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PROTEASE 
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(vi) CURRENT APPLICATION D#TA: 
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(B) FILING DATE: o/- JUL-1997 
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(vii) PRIOR APPLICATION DATA: 
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(A) TELEPHONE: (703) 816-4000 
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[2) INFORMATION FOR SEQ ID NO: 1: 

(i) /SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3018 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

r (ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
(D) OTHER INFORMATION: /label= FIGURE 8a 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



TGATAGGTGC TTGTAAACTG TGCTTAACGA AAACATACCG TGTGCTGTAG GGACTTAAfZfT 
CTTGTTTATA TCAGTTAGCC TGGTTTCGCT AACAGTACAT CATTTTGCTT AAAGTGACAG 
CTTACGAGAA CCTATCGATG ATGTTAAGTG AGGATTTTCT CTGCTCAGGT GCaCTTTTTT 
TTTTTTTTAA GACGGAGTCT CTTTCTGTCA CCTGGGCTGG AGTGCAGTGG /CGTGATCTGG 
GTTCACTACA ACCTCTCCCT CCTGGGTTCA AGCAATTCTT CTGTCTCAGC CTCCCAAGTA 
GCTGGGATTA CAGGCACCCC CCGCCACACC CGGCTTATTT TTGTA2TTTTT AG TAG AG AC A 
GGGTTTCACT ATTGTTGTCC ATGCTGGTCT CGAACTCGTG ACQTCATGTG ATCCACCCGC 
CTCGGCCTCC CAAAGTGCAG AGATTAGAGA CGTGATCCAC ^TGGCCCAGC AGGACCACTT 
TTTAGCAGAT TCAGTCCCAG TGTTCATTTT GTGGATGGQG AGAGACAAGA GGTGCAAGGT 
CAAGTGTGCA GGTAGAGACA GGGATTTTCT CAAATG^GGA CTCTGCTGAG TAGCATTTTC 
CATGCAGACA TTTCCAATGA GCGCTGACCC AAGAACATTC TAAAAAGATA CCAAATCTAA 
CATTGAATAA TGTTCTGATA TCCTAAAATT TTAGGACTAA AAATCATGTT CTCTAAAATT 
CACAGAATAT TTTTGTAGAA TTCAGTACCT yCCCGTTCACC CTAACTAGCT TTTTTGCAAT 
ATTGTTTTCC ATTCATTTGA TGGCCAGT^G TTGGGTGGTC TGTATAACTG CCTACTCAAT 
AACATGTCAG CAGTTCTCAG CTTCTT^CCA GTGTTCACCT TACTCAGATA CTCCCTTTTC 
ATTTTCTGGC AACACCAGCA CTTCATGGCA ACAGAAATGT CCCTAGCCAG GTTCTCTCTC 
TACCATGCAG TCTCTCTTGC TCTCATACTC ACAGTGTTTC TTCACATCTA TTTTTAGTTT 
TCCTGGCTCA AGCATCTTCA JSGCCACTGAA ACACAACCCT CACTCTCTTT CTCTCTCCCT 
CTGGCATGCA TGCTGCTGg/ AGGAGACCCC CAAGTCAACA TTGCTTCAGA AATCCTTTAG 
CACTCATTTC TCAGGAGAAC TTATGGCTTC AGAATCACAG CTCGGTTTTT AAGATGGACA 
TAACCTGTCC GACCt/cTGA TGGGCTTTCA ACTTTGAACT GGATGTGGAC ACTTTTCTCT 
C AG AT G AC AG AAT^ACTCCA ACTTCCCCTT TGCAGTTGCT TCCTTTCCTT GAAGGTAGCT 
GTATCTTATT TTCTTTAAAA AGCTTTTTCT TCCAAAGCCA CTTGCCATGC CGACCGTCAT 
TAGCGCATCT yGTGGCTCCAA GGACAGCGGC TGAGCCCCGG TCCCCAGGGC CAGTTCCTCA 
CCCGGCCCAG AGCAAGGCCA CTGAGGCTGG GGGTGGAAAC CCAAGTGGCA TCTATTCAGC 
CAT CAT GAG C CGCAATTTTC CTATTATCGG AGTGAAAGAG AAGACATTCG AGCAACTTCA 
CAAGAAATGT CTAGAAAAGA AAGTTCTTTA TGTGGACCCT GAGTTCCCAC CGGATGAGAC 
CTCj/cTCTTT TATAGCCAGA AGTTCCCCAT CCAGTTCGTC TGGAAGAGAC TCCGGTGAGT 
AOCTTCCTGC TTGCTGGCTG GGTTTCCCCC CCACGGAGGA GTCCTCTCAC TCAGCACCTC 
Zggcagctca GCTGTGCACA TGGGCACTGG GGGAAGGATC CTGGCAGCAG CTCTGCTGGG 



CTCTGTCTTT AAGTGTGAAG CAGGGAGGAG AGGAACAGGT CTCAGATATT TCACCAAATC, 
TCAGCAAAAT CCAGAGGGAG AGCGCAGGAG GTGGGGTGAT TCTTATGCTC TGGCTCTOTC 
TCTCTGAAAA AAAAAAAAAA ATCTTGCTTT TTATAAAAGT GGGTGGAACT CAGTT/TAATT 
CATCCTGTAA AAATAAATAT TCCTTTCTCA GAACAAATTC CAGACAGCCC AG^TGTACCT 
GTTCGTTTTA ATATTATTCA TCTTGGTAAG ATTATTTCAG TTTCTCTGGC yTAAAATCATG 
ATGTTATTCT TCTTTAATTT ACCAATGGCC ATTCTTTCTG AAACACAC^A AC C C TAG AAA 
GAGAAGAGTC ATAGGCAAGG AATTTTTTTC ATGCATAAAA TGTTGGGGTT AAAGAGAGAG 
AGACCTAGCA ATCGCTTTGG TCCACCTACC TCACCTCATA AGT^AGGAGT CAAGGCACAC 
TAGAGTGAAA TATATCTAGT GGGCACATGA CAGAGCCCGG A£TAAAACTT TGTTTTAGGA 
AACTCTCCCA GCCTCTGGGT TTCATTTACA GTGATCGCCa/gGAGGGAAAT CACATTCCCC 
TGGCTCACCT CTCTGATCAT CCCTCCAGTG TGACTCTJGT TCTTAATTCG AGAAATATTT 
ATTGAGCATC TACTAGTGCC AGCACTGGGC AAGCA^CTGG GGGGACAGCA GTGAGTAAGA 
AAGACCAAAA TTCCAGCTGT CTTGGAACCT AGG^TCCTGA AGGGAAGATG GGCATTGAAC 
AAGAGTGACA TTGTCAGGAG ACGATGTTCT QGGTGCCACA GGATCATGTG GCAAGGAGAG 
CTAACCTGGT CCAGGGAGAC AAACCCTCTc/tGAGGAAATG ATGACAAGCT GAGACCCAAT 
ACTATTGATT AGCCATGGTT TTCTTTAACC TAAGGTGGGC CAGGCATGGT GGCTCATGCC 
TATAAACCCA GCATTTTGGA AGGCCOAGGC TGGAGGATTG CTTGAGCCCA AGAGTTAGAG 
ACCAGCCTGG GCAACAGGGT GAA5ffi.CC TAT CTCTTTTGTA CTAAAAATTC AAAAAATTAT 
CCAGGCATGG TGGCACATGC CTGTGGTCCT AGCTACTCAG AGGCTGAGGT GGGAAGATCA 
CTTGAACTCG GGGAGTTTGA /GGCAGCAGTG AGCCGAGATC ATGCCACTGC ACTCCAGGCT 
GGGTGACAGG AG T GAG AC / 

(2) INFORMATION EOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11451 base pairs 

(B) /TYPE: nucleic acid 
(c/ STRANDEDNESS: double 
(m TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

/ (A) NAME /KEY : misc_f eature 
/ (D) OTHER INFORMATION: /label= FIGURE 8b 



(*i) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



4 



GATCCACCCG CCTTGGCCTC CCAAAGTGCT 
CCGACACTGC CCTAACTCTC AAGTTGCATC 
CAGCATGGGA CAATGTAAAA AG GAG G CAT G 
GTCTTTGCAC GAGTTTCTTA ACCTCTCTGG 
TGATAGTATT CCCTTCACAG GGCCAAATGG 
TCAATAAATA ATAGCTACTG CGGCCGGGCG 
TGGGAGGCCG AGGCGGGTGG ATCACAAGGT 
GTGAAACCGT ATCTCTACTA AAGATACAAA 
AGTCCCAGCT ACTCGAGAGG CTGAGGCAGG 
TCAGTGAGCC AAGATTGCAC CAGTGCACTG 
AAAAAAATAC CTATCTATCT ATCTGTCTAT 
TTTGTTTCAC AGGAAATTTG CGAGAATCCC 
GACATCTGTC AAGGAGAGCT AGGTAGGAAA 
AAGGGGTGAT TACAAGGTGT GATCCCCTTC 
TCCAGTAACT TTTTGGAAGA TTTTTTATAA/ 
TGGCGATTGC TTCATTTCCT CCTACATGTCC 
GTATCTGCAT CCTGTGGCCT CCTCTjZjCAGT 
CATGACAAAA GCCCTGCTTT TCACTGTATC 
GCACCAAGCA TGCCCCTTGG GCATGGAGAT 
GGAAAGCACT TGTAACTGG^f ACCCTTGGTT 
AAGAGTCTTT CCACAAAGAT GGCATCCGCC 
TTGGTTAGCT TGACTGCTCC ATCTGATCTT 
TTGTCTTTGG TGTGGACTAT AAGCAAGCTC 
GAAACAATTT AAATTTGAGG AAAAGGGGGC 
ATTCCAGAAG CSGGAGGCTGA GAATAAATCA 
AAGGCTTCC]/ GCAGAGCCCT GGGCATAATA 
TCCAAGGAAA GACTGGCTGC TTCCAAGGAG 
TCAAGTOTCC CCTTGCACAC TCTCAGGTTG 
GAAGGCAGTT CTTTGTGACC AGGGTACACC 
GAG&GAGAGA GAGAGAGAGA GAGAGAAAGA 
TACTCTGTCA GAAAAGAGGT TCAGAGAATA 



GAG ATT AC AG GTGTGAGCCA CCACGCCCAG, 
CTTACTCGAA TAGTATGACA GTGTGGGAAG 
TTTCTGGCTT CTGCTACTTA CTAGCTGTGT 
GCCTCAGTTT CCTTATCTGA AAAATAACAA 
AATACTATCA GGAACACTAC Z*TAATGGAAC 
CGGTGGCTCA CATCTGTAAtf CCCAGCACTT 
CAAGAGATGG AG AC CATC CT GGCCAACATG 
AATTAGCTGG GCAT^GTGGC GCATGCCTAT 
AGAATCACTT GAACCCCGGA GGCAGAGGTT 
CAGCCTGGCG/ACAGAGTGAG ACTCCGTCTC 
CTACTGTTAT TCTTACCTGG TCATTTCCTT 
CGATTTATCA TTGATGGAGC CAACAGAACT 
GTGGCTCAGG TCAGATCCTG CCAGATGATC 
C^GGAGGTAA AGGGACAATC TGTGCTTGCT 
CAGTTGCTTT ATGGTCGTTT ATCTACATGC 
TCTTTAGCAC TCTGCCATGC ATCACAGGGG 
ATCTCAAGGA CACTTACATA. CCCCACTCAG 
GTCTTTCTTG GAAGACAGCT CTGTGACTGT 
TCTAGATACA CACACAAAAG GCATCGCCAA 
TAAATTGGCC CAGCATAGCT CCATCTTTAA 
ATGTGGATGA GCATCCAATT TTCTCTTTGA 
CCTCTCTCTC GACCTCTTGT TCAGAAAGTA 
TGTGAAGTAA AATTGGAGAG AACACCAACA 
ACCTAAGACC AAAGGAATTT GGCTTATTTC 
GATGAATATC TGGGTTCCTG CACCTGAGGG 
ATCTGGGACC TTCAAACCAA TAACCTCTTT 
GGTAGGGGAG AGTCGGGCTG CAGGCAGCTC 
GCATTTTCAC TTTAACCCAT CCTCCCTTAA 
CCCTATTATA TATATATATA CACACACAGA 
GAGCAAAGTG TTACCTCCAA CTACATACAG 
AGAAAACGTC CCGAGCTCAT TCCGTTGCCA 
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GCAATGTCTT ACTGCCCCCT ATAGACGGGT TCCAGGGCAG CTGCCTACCT GGCCTTCCpf 
CCAATACAAA TCATCTTGGT GGATGGTTCT CTGAGGCTCA GTCTTCGCTG AAGTCAGAAG 
AGGAATTGGA CTCACATTGC AAAGGCACAG GGCAGGGCAG ATTTCCTACA GGT^TTAGGA 
AGAACAACCC AG T TAT GAT C ACCTACTGCT CTGTCTCCAT TGAGGCCTAA &AAGGAAGTG 
AGTTTATACT GCAGTTGGAG GAACTGCCTG CAGCCTTGAG GAAAATGTCT AGTCACAAGG 
GAGTAAGTTA CCTGTTGATC ATATTGTCAA GGAATTCCTG TCCAATTCTC CTTCCCTGGG 
TTGACACCTC TGTAAGGTCA GATCTGGAAG TAGGAGAGTG GGCA^AAGG GAGTCCCCGT 
TCAGGGAAGT GGAGTGGCTG GCTGGGATTG GGGCTTTTTC TT£CCAGGAG GAGCAGGAGT 
GCTCACGATC TGTGCCCTGT GTCTGCCTGC AGGGGACTGC /TGGTTTCTCG CAGCCATTGC 
CTGCCTGACC CTGAACCAGC ACCTTCTTTT CCGAGTCA^A CCCCATGATC AAAGTTTCAT 
CGAAAACTAC GCAGGGATCT TCCACTTCCA GGTGAGGTAA TGAGAGTGTA GTTAAGAGGG 
CCAGCGGCAG GCCACCCACC GCTGGTCTCC TGGOCTTGAC TTCCCAGAAG CTGGAGGAAA 
CTTCCCACCC ATCTACCCGC AGCGGCAACA G^CGGCATGG ACCCCCTTAA GGCTTCAAGC 
CTGGGAGGAA GCAGTTGCTT ATCTCTGGCt/cCCTAATCCC TCCCCCACCA CCTTCCACTA 
TGTCCCAGAA AGACAGGAAG ACATCCTGTT TACTGTGGGT CTATTTTTGT CTTTGCAGCT 
GTCTGGCTGC TTTTATTGCC TGCAGQCCTT CTCAAGTAGG TCCCTAAGAT ATTAGCACTG 
TGACACCACA GGACCCTTCA GGTTCTACAG GAACCCCTGT CCAGGGCTCC TGTATACTTC 
TTCCTCTCTA AGGCATGGCG G^ACCAAGGC TATCACTCCT CTCTTCCAAG CCCTGGAAGA 
AGAGTCTGCT TAACCTGGGG/aTCAGGCTTC TTGTTTGCCC TAGAACTGAA TCTGATGGTT 
CTAGAATCCA TCCAGCTACT GGAAATTTTC TGGGTCCCAG TCACCTTGGC ATAGAGCTGG 
TGCTAGAGCA GAACCA^ACT GAATTCTACC TGTGAGGGTC TCGTAGCTTC CGGGATGCTG 
GGGAGTCAGC CTGTCTCCAG CTTCAAAGGC TCCCTCATGT CCCAGGATGA CCCACATTAT 
CAGTTCTTGC TGCCCGGGTC TTGCACCTCA GCACGGAAGG CCTCAGAAAA GGTCTGTCTC 
CAGGCTCAGA yCTCCCCCTCC TGCCGCCTTG GGAACATGGC ATATTTAAAG GGTCTCAGAT 
CTAAAGGGCC TTACATACAA ATATCAGATA GATTTCTGTT CTCATTTCAA TGAGGGAGAA 
agtgcca/tg AAAAGGAGAC TAAACCACAT TTGGCCCTTT TCAGTTCAAA CTGATTCATT 
CAAAAAAGAG CGACATCCAA ACTTGAAATG ATTGAACAAT GTTCCTGCTA CAGCTAGAAT 
AGATTCTGGG TCACTTTGTT CCTCCGTTTC AATCCTTGTT CTTCAGTTTG GCATCAAGAA 
ATACCTAAAT CAGCACAGTG CCTTCACTGC ATAGTTCCCA ATCCTGGCCA CATTGAATCA 
GOTGGGGGCA CCTGAGAGTG CTGACACCCA GGCCCTGCCC CAGACCTGCT GAGCAGGAGA 
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ATGAAAATCT TACATCCTAA GACACTCATG GAGCACCTAC TCTACCCATT ACTGGGCTGG 
ACTCTGTGGA AGACATGAAG TATATGTAAC TCACTTCCAG CTCTCAAAAA GCACc/aGTC 
CAGTTAGAGA CAGATTTACA CACCCCAAAC ACAAAATAGG ATGAACAGGC AOZCAGATGC 
AGAGTCCAGG AAATGATGCT GCTTTGGGAT TCAAGAACCC CCTGAGGAAyGTGGAGGAAG 
GACACATTTC CTAACAGTAA TTTGAGTATG TGACTCTGTG CGTGACGc/t CTGTGCAGTT 
CTGGCGCTAT GGAGAGTGGG TGGACGTGGT TATAGATGAC TGCCTJZGCAA CGTACAACAA 
TCAACTGGTT TTCACCAAGT CCAACCACCG CAATGAGTTC TGGAGTGCTC TGCTGGAGAA 
GGCTTATGCT AAGTAAGCAA CACTTTAGAA TGTGAGGTGG GGCTAGAGGT GAGAAAGTGG 
GTTGCAAAAT CCAGCCGAGA CCTCACTCAC AGGAAGAGGC ATGTGCCTCT ATACGTGCAT 
ATGTGTGGGC ATGCAAGTCC AACTGTGACC CAAAGTT&GA GATCAGTTCC AGGCAACAAC 
AGCTCTAACT AAAAACATTA AATTTAAGAG TAGA/ATGAA GATTTGCATA GAAGACCTTT 
AGCTTTAGCT CACCATAGCG AGTTCTTTCA TT^CACCTCC ATGGTGGCAT TGCAAGTCTT 
GGGATCAGAG CATTGTCCCA GGGTCTCGAT /gGCTCAACC TCATGTGCTT ATAGAAGATT 
TATAAAGACA TGTTGTCTCT CAACTTAM& GCTCCACCCC AGATGATAAT AATGGATTTT 
CAAATTTTGG AACAAGGTCA CTCTGTJftTG CAGGCTGGAG TGCAGTGGTG CAGTCACGGA 
TCACTGTAGA TTGACCTCCT GGGT^CAAGG TGCTCCTCCC ACCTCAGCCT CCCAAGTAGC 
TGGGACTACA TGCGGGCATC AC/<ATGGCCC TTTTATTTTT GTATTTTTTT GTAGAGCGGG 
GTTTTCCCAT GTTGACCCAG /cTGTTCTCG AACTCTTGGG CTCATACAAT CCACCAGCCT 
TGCCCTCCCG AAGCGCTGGG ATTGCCGGTG TGAGCCACCA CACCGGCAGC TGCTAATGGC 
TTTAATGCAG CCCTTCOTCA ACGTTCAGGA TGTAGTGGAA AGAGCTCTCA GGAAGTGGGG 
ATAGCTGGGT TTCAa/cCCA GTGCTTCTGG CTCTCTGTGG TCTTGGGTGG GTCACTTAGC 
CTCTTGAGCT CAg/tTCTTC ATTATGAAGA AAGGGAATCA TTGTTTCCAT CCCATGAGCT 
CATAGGGTTA a/gTGGAATT GATGAAAGAA CATCACAGCA TCCAAGAGGT AAAGTTCTGG 

tggcagtggt/acctgggttt tgttccctgg aactctgtga ccccaaattg GTCTTCATCC 

TCTCTCTASK GCTCCATGGT TCCTACGAAG CTCTGAAAGG TGGGAACACC ACAGAGGCCA 
TGGAGGACTT cacaggaggg GTGGCAGAGT TTTTTGAGAT cagggatgct CCTAGTGACA 
tgtacAgat CATGAAGAAA GCCATCGAGA GAGGCTCCCT CATGGGCTGC tccattgatg 

taaQtctggg gtgtggggca cagggtgggg agctccaagt gtcaggaagc cttttaccca 
at/aagggca gcatagagct tttgtgtggg acagagcgaa tgttttgttt gaggaagcag 

(3AACTGGCTC TCAACTTTGA GGACTGGGAA TTTCTCAAGG GAGAACAGTT CTTCCGGATT 

/TTCAATAAAG ACACTGGTCA aggacatttc aagccctgga atgtcagtgg aaatcagtcc 
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AGAGGCCTGT GTCAGTGGAG GCCTCCCTTG CTGGTGCTCC TCAGTCTCAG CACGCTCCCA 
TTAAGCTGGC CACGTACTTG GCTGTGGACC TGAGCCCACC ATTTCCCTAA GAAAI^CCTCC 
CAGTCACTGG GCTTTCACCA CACCTCCCCG CTTGAGACGT GGGCTTTGTG TTGTTACCTG 
GGAGAAGCTA AGCCTGCAGC ACCTTTCAGT GCAAAGAAAT GCTGTGAACT/gAGACAGGAG 
CCAAGGGTAG GGAGATGGCC GCCCATGGCC AGGCCTCCTT CAGGGGGGAT GCCTTCCCTG 
AGGGCTGCTC AGTATATTGA TATGATAATC TTAGTGGTTT CCATT^GGGA GGATGGGGCT 
GAAGCTGAAT TCCTGCCCCT TCTTCTCCCA ACACGCCCAA TGGACAGCTT GGAAGGTCAG 
T TAG C AC AC A ACACCATGGA TGAACTTTTT TTCTGTATCA OTTTTCTCCG TCTTTCCTCC 
ATTCGTGCTC TGTTGATCTC TCCTCTCTCC CTTTGTCTg/ CCCATCTCTT TCTCCTCTCT 
CCTTCCCTTT CCACCCTTCT GTGTTTGTTC TCTCCCTtZCC CTGTGTTGTT CCCTACATTC 

tccatcgggc ctcaggatgg cacgaacatg accta/ggaa CCTCTCCTTC TGGTCTGAAC 
ATGGGGGAGT TGATTGCACG GATGGTAAGG AATATGGATA ACTCACTGCT CCAGGACTCA 
GACCTCGACC CCAGAGGCTC AGATGAAAGA OCGACCCGGG TGTGTACACC TCCGATTATC 
AGAACTGACC ATCCCTCCAA CCCACATGAC? CCCGCCCTAT TAGTGTCAGA CTCCCCTCAG 
CAGCCAGGGC CTTACCCACA CACCCCCJTCC TGGCACCTCC CAAGGGTCTG GGTTGAAATA 
ACTTGCTCAG CCAAGGCTCC TGAAG£GGGT GCAAGAACCA GGATTTTGGA GGGAATCTCT 
GCTGGAGTTT CTGCATATTC CATGGTCCAG GCAGTTCCTC TCATAACGAA C TAT C AG AC A 
GAAATACTTG TAAAGATACT TCATTTATTT TGAAATATTT TTCCTCTTCT AATGTATTCA 
TTTATTCATT CAACACTTAT/ TTTTGAGCTC CTACTATGTT CCAGGCACTC CTCTAGCAAA 
CAAAGCAAAT TCTCTCCDCT TTTTCAATAT TTGTGGAAAA AGCAAGGTCT CCCTCTTGTA 
GAGTTTATAT TCTAGmTTT TCATAAGTTA TACCTGCTCA CTGGAGAATA CTGAGCCATA 
CAGAAAAACA CAGAGGAAAA TTTCACTTAT ATTTTTCCCC ATGTAAAGAT AACCACTCTT 
AACATCTAGT ATilTGTTCTT CCAGGATTTT TCTATGCACA CACTGAATCT GTATTTTTAT 
TTTTAAAATG /tATCATATT GTATGTACCT CTTTGCAGCC TGCTTTTTTC AGTTAGTTTT 
TTTGGTTTTy TGGTTTTTTT TTTTTTTTGG AAACCAAGTC TTGCTCTATT CCCTAGGCTG 
gagcacag/t GTTGCCATCT CGGCTCACTG CAACCTCTGC CTCCAAAGTT AAACTAATTC 
TCCTGCOTCA GCCTCCCGAC ATAGCTGGGA TTACAGGCAC ACACCACCAC ACATGGCTAA 
TTTTTSTATT TTTTAGTAGA GACGGGGTTT CACCATGTTG GCTGGAATGG TCTTGAACTC 
CTGACCTCAA GTGATCCACC TGCCTCAGCC TCCCAAAGTG CTGGGATTAC AAGTGTAAGC 
CAOCACACCC GGCCTAGTTT GATATTCTTA ATGTGCCCAA AGTATTCTCC TGTAACATTT 



TTTAATAGCT 
TATTATTGGA 
AAATGCTATA 
AGGACAGATT 
AGAGTCATTT 
CCCCTCCCAG 
GATCTGGGAG 
ACAGCTCTAG 
GTTAGTAGGC 
CCCTTCTCCC 
CTCCTTAGGC 
CTGGGCACTG 
AGGGGATGTA 
ACTTTGGGAG 
AACATGGTGA 
TGCCTGTAAT 
AGAGATTGCA 
TCTGTCTCAA 
TTAACCCCTT 
GGGGAAGGGG 
GATACTAAGG 
CCACATTGCT 
TGTGTCTCTT 
TTCAGGGAGC 
TGCCTCCAAG 
TACAGACAATy 
ACGCCTAc/c 
CACCCTGCTG 
GCAGAGCTTG 
GGGGGfCTGTG 
TTTTCGTGCT 



ACACAATATT CAAACACACA GATATGTTAT AATTTATTTA CCCAATACCC 
AAGTTGAGTT CTTTTTTTTC TTTGTTTTGT TTTGTTTTGC TACT^TTCTA 
ACGAACATCC CAATAGATAC ATCTTTGTAT ACATCCATGG TGACTTCCAT 
CCCAGCAGTA GAATTGCTGG GTTGAATGAT ATGCTTAGGG yiAATGACAGA 
CAAGCAGCTT CCTAGGGTCT TAGAACTTAA GGATTAAT^A GTCTTCCCGC 
TCTATTCAGC ATGATCTGGA T CAT GAG G AC TGAGATCTGG AAGAGACTGA 
AGGCTGAGAT ACCAAAAGCC CTGGCTCCAC CCA^ACCCCT CGCCCTGAAA 
GAATTCCGCG GCCTAGCAAG GCTCCGGGAA GCTCCTTTTA AAGCTGTGAC 
ACATGGACCA TAGAGACCTA TCCAGGGCTC/aTGGGACTTT AGTGATCCTG 
AAGGATCCCC CATGGCTGCA ACTTGGAA&T TTCTGCAAAT GGAAGAGCTA 
ACGGTCATGT CTGAGCAGGG ATCTCOTCGG GCTTTCTTAG ' AATTCTCTCC 
GGACTCTTGA TTTCTTGAAT ATT^TGTTCC AGGTGGGTGT GGAGGAGGTG 
AAGAAGGCTA GACTTGGCCA GGCGCAGTGG CTCATGCCTG TAATCCCAGC 
GCTGAGGCGG GTGGATCACC /TGAGGTCAGG AGTTCGAGAC CAGCCTGGCT 
AACCCCGTTT CTACTAAA^A TACAAAAAAT TAGCTGAGCA TGGTGGCACG 
CCCAGCTACT CGGGAGGjCTG AGGCAGGAGT ATCGCTGGAA CACGGGAGGC 
GTGACCCGAG ATCGTCCCAC TGCACTCCAG CCTGGGCGAC ACAGCAAGAC 
AAAACAAAAA AG^AAGAAAA AAAGGAAAAG CTAAGACTTA CATGTGTCAC 
TTCTCAAACC TCTTTCTCTT CCAGGAATAG TCAACCCCTG GATGGCTTCA 
GATCCTGAAc/ CCCAGGGCAG CCTCCAACTC TACCCCTTCC TCCTTTGAAG 
GGTCCAGAAA GGAGGGGCAG GACACTGTTA CCCACCCCAC ATCCCAGCAT 
CTCTGATGGT CAGGACAGAG CCTTCTCAGG GAGACCAGCC TGTCTGGAGC 
GGCACTCTTA AAGGGCCACT GAAGGTCCGT TCGTGGTCGT GAGGCACACT 
ag/gtggtct GTGTCTTCAC AGAGCCCGGA AAATGAACTA GTATGAACTT 
OAGCAGAACT TCTGTTCCCC CGCCCCTAAT GGGTTCTCTG GTTACTGCTC 
/ CATTCCGGTT CAGTATGAGA CAAGAATGGC CTGCGGGCTG GTCAGAGGTC 
TGTCACGGGG CTGGATGAGG TAAGCCTGGT GGGGCTTGGT GGGGCAAGGG 
GGTTAACCTC ATGAAGTCAG GACTTAGCTG TTGGGGCCCC TGCCCTGTCT 
CCTCCAATCA GGACATTCAG TTCAAGGTCC AAGCCACGCC TGGGAGCAGA 
AAACTGGTAG AGGTGGATCC TGCCACAGTT GGTGCACAGT TTATCTTTGC 
AAAGATGGCA ATTTTTCCAA CATTTCCAAT GAACAAATTG AAATATCACT 



A 



TAACTTTGCT 
AGATAGCTAC 
CAAAATCTGC 
TGCACATCCT 
GAAATGCTAT 
TTGTATTATT 
GCCATGGATA 
TGGACCATGT 
CCCTGTACAG 
GGAGGGAGTC 
TTCCTTTTTT 
GCACAGTCTC 
CCTCCTGAGT 
TAGTAGAGAT 
TCCGCCCACC 
AGCAGGGGCC 
CTCCTGTTTA 
TTGCCTCTTA 
GGTCTGACCC 
CAGGCAGAAC 
ACAGAAGATT 
GTCCCGTTCA 
GAGTGGAACG 
GGGGAACAGG 
CTTGGTATAA 
CTCCAGGG. 
CCATTGGCfG 
ATGAATGCAG 
TCCTGjGAAA 
CTGGGACTTG 



TTTACAAAGT 
AGTTGTCTCT 
AGATGCTCAA 
CCCATATACT 
GTAAATAGTT 
ATTTTTTCTT 
CGAGAGGCTG 
CTGAGACAGG 
AGGGATGGGC 
TGCTTGTTTG 
TTTTTTTTTG 
GGCTCACTGC 
AGCTGGGATT 
GGGGGTTTCT 
TCGGCCTCTC 
TTTTTTCTAA 
TGCCTCACCT 
GAAGATAGAG 
ACCCCCTGCC 
ACCCTCGCG 
CCCTTTCOAG 
AAGGTGAGAA 
GTTOTTGGAG 
GTCfCGGGACA 
.TCACCCTC 
GGGCCAGGAG 
GAAAGGAAGG 
GGTTCTGGGC 
TGGGAACAGT 
TTTTTGGGCA 



TGGTTTCATG 
GGGTAGCCAC 
GTCCCTTCTG 
TTAAGTCATC 
ATTGCACTGC 
TTTTTGAATA 
ACTGTTCTGT 
AACGTTGTAA 
TGAGAGGGGC 
TAGTTCCTCA 
AGACGGAGTC 
AATGTCCGCC 
ACAGGCGCGT 
CCATGTTGAT 
AAAGTGCTOG 
TTTATATCAA 
CCTCGCCCGA 
AGGfAGATGCC 
TCCCCAGC 
AAGAGATTTG 
AGAGGCTGCA 
AGTGAAGCTG 
TGATAGGTAG 
AGGCTGTGTT 
AAAACCAATG 
TGGAAGCGGG 
ATTCCAGAAA 
TAGAGAAGTG 
ATTATTAGCA 
AGTGCTGAGC 



TGTTCTTGAG 
GGGGACTGGT 
TAAAATGGAG 
TCTGGATTAC 
ATTGGGTTTT 
TTTTTGATCC 
TTTGCTCCTT 
GACCTGTTGC 
AGTTGCCTGC 
GTCAGCAGGG 
TCACTGTGTT 
TCCTGGATTC 
GTCACCATGC 
lAGGCTGGTC 
GATTACAGGC 
GACACCTAAT 
AGCTCATACG 
AAGCCTAAGT 
ACACTTGTGA 
CCCCCCAGCC 
GAGCATGAGA 
GTGCGGCTGC 
GTGAGGGGAC 
GGGAACTGAG 
ATCCGCAGAG 
GTGCTGGGGA 
GCGTGGGGAA 
ACTTCCCTTC 
CTTACCTTGT 
CATTGCTAAG 



CTTCCTGTTC 
TCCAGAAGCC 
TAGTATTTGC 
TTACGATACC 
TTTGGTATJTA 

acaattcgtt 
ctggtcacttc 
m!acagttgg 
atcacccatt 

GCCTTTTGTC 
GCCCAGGCTG 
AAGCGATTTT 
CCAGCTAATT 
TCGAACTCCT 
GTGAGCCACC 
TTATATGTGT 
GCAGGATGTT 
TAGGCAGACT 
TTAATCTCCT 
CCGTCCCAGC 
GCTCTTTCTG 
GGAATCCGTG 
CCCACGGGAT 
CCATGAGAGT 
AAGAGGGGCA 
CCCAGAGAGG 
GGTCCAGGCA 
TTGGGGTCTT 
GGGCTGATAT 
ATTCCCCTTA 



TCTCGTOTTC 
CCAM^AGTAA 
AmTAACCTA 
'AACACAATG 
TTTTCTGTTG 
ATATGCCAAA 
TGGGTTTTCC 
GCAGGTTGTG 
GCAGCAGACT 
TTTCCTTCCT 
GAGTGTAGTG 
CCTGCCTCAG 
TTTGTATTTT 
GACCTCGTGA 
ACGCCTGGCC 
TAGCAAAGCC 
CCTGAGAAAA 
CAGGAGGATA 
TGGCCAGAGC 
CCTCAGCTAG 
TGTGCTTAAG 
GGGCCAGGTG 
TGGCGGTGGC 
ATTGAAGATG 
CAGGTGTTGG 
TTGCTGACAA 
GGAAAAGCGT 
GTGTTGCCTT 
TGAGGAGTAA 
CCCGTGCTTG 



TCCCTTGTAT TAAGGCACAA GGGCCCTTTG AAAAGAATTT TACCTGCTTT 
AGGGATTAAG ACCTTGGGGG CCAACCCAAA ATAAACATGC GAACTTATTA 
CCATGCACAC TTCGTAAAAC CTCCATGGTC CTACTGGTTC CTGATTACCT 
AGAGGCAATT CATTACTGAA TGAGCCATAA GCGCCTCTTA TTTCGAGAGG 
GACTCAGTCG AGGAGAAGGA CCGCACCCAG GCAGCCTGGG CCCCTCGGCT, 
TTTACTGCTG GGTACTTCCT AGCCCAGCAT GTAATTACTG GTTCGTTCAG 
AGTAAATGTT TCTTGGGCAC C TACT AC AT A GGAGGCACAG GTCAMGCAC 
CTTTCTACCC ACCCCCTCCC TTGATACACT GTGATTAGGG ACT^ACCGAT 
(2) INFORMATION FOR SEQ ID NO: 3: / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 1834 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: genomic DNA / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eat^re 
(D) OTHER INFORMATION :/label= FIGURE 8c 

(xi) SEQUENCE DESCRIPTION^/ SEQ ID NO: 3: 

ATTTTTTTTT TTTTTTTTGA GACGGAGTCT CACTCTGCCA CCCAGGCTGG 

CGCGATCTTG GCTCACTGCA ACC^CCGCCT CCCGGGTTCA AGTGATTCTT 

CTCCTGAGTA GCTGAGACTA TA&GTGCCCG CCACCACGCC CAGCTAATTT 

ATTAGGACGG GGTTTCACCA yTATTGGCCAG GCTGGTCTCG AAATCCTGAC 

GCCCACCTCG GCCTCCCAAA GTGCTGGGAT TACAGGTGTG AGCCATTGCG 

AACTCAATTC TTAACCT>TTA AAGTATGATG AGAAGAAGGA TCAAGCCCTC 

TTAAGGAGTT TAGGGTCAGT CTTGAGGATG TGAGAAGTCA TTGCTATTGG 

GAGGTTAACA GGTGAAGTCA GCATTTTGGT AGTTCACAGC AGCTGCAACT 

CTCTGATACC l/cTGTCCCA ACCTACATCA GGCCTTCCCT TCTTCCTGCT 

CTCCATTTTC/CCACCAGATG GAAGGACTGG AGCTTTGTGG ACAAAGATGA 

CTGCAGCACC AGGTCACTGA GGATGGAGAG TTCTGGTGAG TCCAGAACCC 

AGAAGGC^TAA GGGTGGGGAA GAGAGGGGAA ATCTCAGACC TCAGTCCCCA 

TCAGATTCCA GCCCTTGGGA GATCTTGGCT GTGTTCTCCT CCAGCCCAAG 

gat/aggttc tgagaggagc CTTCCAGGCC ACAGGGACAA TGAGCCCAGG 

CAfrGACATGG CTCTTGCCTC CTGTGTGCCC CTCCGCCACA CACTCTATTC 



ATCAATTC 
TTTAT^GGCT 
CCAOTCAATG 
GtfGATGGCAG 

'cctgtactta 
tcattcgttt 

TGGGGATATT 
C 



AGTGCAATGG 

CTGCCTTAGC 

TTGTATTTTT 

CTTGTGATCC 

AGCAGCCCAG 

ACCAGCCCAT 

GTTTCACACT 

CTTTGTATTT 

TCCTTAATTC 

GAAGGCCCGT 

AGGAAGACCC 

GCTAAGGTTA 

GCCCAGCAAG 

ACCAGGCCAA 

CAGCCACAGG 




CACCCTGGCC TTAGCACAAT TCTTTTCTGA GCCTAGGAAG 
CAACGTCAAC CTCACCCTCT CTCAGGTTGT TTCTATTCAG 
GAGAATTTTC AAGTCTCAGC TTAAGGAGAG CCCCCTAAGT 
TTTATGATGC TCATCACCCT TAAAATTGTT TGCTTAAGCC 
GTAATCCCAG CACTTTGGGA GGCCGAGGTG AACGGATCAC 
ATCTTGGCTA ACACGGTGAA ACCCTGTCTG TACTAAAAAT 
CGTGGCAGCG TGCGCCTGTA GTCCTAGCTG CTGGGGAGGC 
AACCTGGGAG GCAGAGGTTA CAGTGAGCCC AGATTGCGCC 
ACAAGAGAGA CTCTGTCTTG GAAAAAAAAA AAAAAATGTG / 
GAAAGGTTTT GGGTGTTTTT ATTACTTTAT TTTTTAT7TA 
CCTCGCTATA TTTCTCGGGC TGGTCTCAAA CTCCTJ^GGCT 
GCCTCCCAAA ATGCTGGCAT GTGGGCCTGG TCAACATATG 
ATTTTAAAAT TAGCCAGATG TGGTGGCGTG "pGCCTGTAGT 
AAGCAGGGGG TCACTTGAGC CCAGGAGGTt/ GAGGCTGCAG 
CTTTTCTTCT GAACGTGAGA TTAAGTG^AG TCAGCAATTT 
GAATTTTTAA CCGTCACGTT GCGGC^AACC AGGT 
(2) INFORMATION FOR SEQ In NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A4 664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULe/tYPE : genomic DNA 

(ix) FEATURE: 

(A) /NAME/KEY: misc_f eature 
(DVOTHER INFORMATION: /label= FIGURE 8d 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AGGAGGTGGA GGTTGCAGTG AGCCAAGATC ATGCCACTGC ACTCTAGCCT GGGCAACAGA 

GCGAGAC/CT GTCTCAAAAA ATACACACAC ACACACACAC ACACACACAC ACACACACAC 

ACACAOATAT ATATACACAC ATATATATAC ACACACATAT ACACACACAC ACGTCTGTAT 

ATATATGTGT GTGTGTATAT ATACACACAC ACACTATTCT ATATATTCTT GTAGAGCTAT 

gtc/tgtctcc TGTGCTATTG AGCATGAGCC CTTTTTTTTT TTTTTTTTTT TTGAGACAGA 

G/FCTCACTTT GTCGCCCAGG CTGGCATACA ATGGCGCAAT ATCGGCTCAC TGCAACCTCC 



CTCCACTTAC 
GCTTCAAGTC 
TCCCCGAGGA 
GGGCGCGGTG 
GAGGTCAGJ^A 
ACACi 

TGA^GCAGGA 
TGCACTCC 
GTCTTAGTTT 
AAAACTATAA 
CAAGCGGTCC 
GGACCCCAAC 
CCCAGCTACT 
TGAACTATGA 
GGCTTAGGAT 



CCTGATCTTC 
TCAGJZTTAAG 
CTGGGATTAA 
;CTCACGCCT 
GATCGAGAAC 
AGTAGCCGGG 
GAATCACTTG 
AGCCTGGGCG 
AATGTCAAGG 
T AG AG AC GGG 
TCCCACCTTG 
TCTACAAAAA 
TGGGAGGCTG 
TTGTCGTTCA 
TATTTATTCA 




GCCTCCTGGG 

CCCGCCATAA 

GCCAAGCTGG 

CTGGGATTAC 

TAACTGTATT 

CAGCCTCTGG 

CACATATGAG 

TGTTAGAAAA 

AGGAAAGACA 

CTGTGTTAGT 

TCTCTTCTTC 

GGAGATCTGC 

GTCTGTGAAC 

AGGTGGGAGA 

GCCTAACTAG 

CCTGTTTTAC 

AACTTTGGAA 

TATGTCCAGA 

TGCCTAACCC 

CTTGACCTTA 

AAGTTGTTGT 

AGAATGGTTG 

GTAAAGTTAA 

TAACAGTAAG 

GAT GAT G AAA 

GGGAGAGCA' 

GGACCTGACA 

TAATCCXAGC 

CAGC^CAGGC 

GAGGATTGCT 

CQTGCCTGGG 



TTCAAGTGAT 
TGCTCAGCTA 
TCTCAAACTC 
AGGCATGAGC 
TTTGTATCCA 
TAACCACTGT 
TGAGAGCATG 
GATGATGGTT 
GGCTCCTCTG 
CCTGGGGTCT 
CAACCTCTCA 
AACCTCACGG 
GAGGGCCGCT 
TGCTCTTGAT 
TGCTTATTAA 
TGAGAAGGAA 
GCAGGAACTT 
TAAGCCCATC 
CCCAAACCTC 
AGCCTAAAG 1 ] 
AATATCTOCA 
TACGGGTACT 
TGTAG 
GGtACTATTG 
fccTAGACCA 
ATTGGGTGAT 
ACATTGCAAC 
ACTTTGGGAG 
AACATAGTGA 
TGAGCTCGGG 
TGACAGAGTG 



TCTCCTGCCT 
ATTTTTGTAT 

ctagcctcag 

cacagcaccc 

ttagccaccc 

ctgctctcta 

cgacatttat 

tggagtagat 

ggaccctgac 

tccgttccca 

ggatgtccta 

ccgatgctct 

gggtacgggg 

ggggggaggg 

gtctctctgj 

accacca/gc 

gtgggaacaa 

CATJCTTTTGA 
'AGCTTACC 
TGGGCCAAAT 
TGTAACTTAC 
CGAAATCCAG 
CCGAACCATC 
GAGAACCAAG 
AGTCAGTAGC 
GTAGGGAAGG 
ATAAGACACA 
GCAGAGCCAG 
CACCTCATCG 
AGGTTGAGGC 
AGACCCTGCC 



CAGCCTCCCA 
TTTCAGTAGA 
GTGATCCACC 
TGGTGAGCAC 
TCTTTTCATC 

cttccatgac 

ctttctggcc 

acatcagaag 

caagttcctg 

gccctcctc, 

tgaggat/tc 

gcagtctgac 

TTGCTCTGCC 
'CTAAGCCGA 
TCCAGACGTC 
TGAGAAGTTT 
TGCAGATGCT 
AAATACCCTA 
CTGGCCTACC 
CATCTAACTC 
TTAATACTTG 
TTTCTACTGA 
CTAAGTCAGG 
TTAGCAGCTG 
AGCAGAGATG 
AAGAAGAATG 
CAAGAAGATC 
GAGGATCACT 
TTACCCAAAA 
TACAATAAAC 
TCAAAAAAAA 



AGTAACTAGG 
GATGGGGTTT 
TGCCTCAGCC 
TAGAGCTTAT 
CTCCCCTCT* 
ATATGCTTTG 
CTGGCXCATT 
TGACAGCGTT 
SAACTATTT 
CCTGCTCCCA 
ATCTACCATT 
AAGCTTCAGA 
GGAGGCTGCC 
AAAAGTTCCA 
CACTATCTTA 
GCAATAGGGA 
GCTTGGACTT 
AGTGAAAAGT 
CTCAAACATT 
CAAAGCCTAT 
TACCTAAAAA 
ATGTGCATCT 
GACTGTGAGT 
CTGCAATAGT 
GAGGGGAGAC 
ATGTCAAGAT 
GGGTGGGTGG 
TGAGCCCAGG 
TAAAAAAAAA 
TGTGATCATG 
AAGACACACA 



ATTACAAGTG 
CACCATCTTG 
TCC^AAAGTG 
T^CTTCTATC 
CTTCCCTTCC 
TTTTAGCTCT 
TTTGAATCAT 
TGCCCTAAAA 
TATTATTGTG 
TATGGCTCTC 
TCACAAAGTT 
CCTGGACAGT 
GCAACTTCCC 
GGCAGAAGAA 
TTAAACCTTC 
GCTGGGTAGC 
AC GAT GAG GT 
GCATCCAATA 
GCTCGGAACC 
TTTACAAAGA 
GTGAAAAACA 
CTTTCACATT 
ACTGTGTCAG 
TCAAGTCAGA 
AGCAGATTTA 
TCCCAGTTGG 
CTCATGCCTA 
AGTTCAAGAC 
AATGAGGTGG 
CCACTGCACT 
AGAGAAAAAT 




ATCAGCGTGT TGTTTGTTTT TGGTGGAGTT AATTGTGGGG TTCTAGGGAA AGGAAOTTAG 
CTTGGGACAT GGAAAGTTTG AGGTTCCTGT AGAGTGTCCC AGTGAAGATT TGT^ATAGAG 
CATCGGATGC GC AT AT TAG A TGGCACTTGG TGATATGATA AGAACTCAAA ^AATATTTGA 
GGAATAAAGG AAAGAAGAGG CCAGACGTGG TGGCTTATGC CTGTAATC^C AGCACTTTGG 
GAGGCTGAGG CAGGCGGATC ACTTGTGGTC AGGAGTTCGA GACCA0UTTG GCTAACATGG 
TGAAAACCCA TCTCTACTAA AGATACAAAA ATTAACCGGG GA^ATGGTG GGTGCCTGTA 
ATCCCAGCTA CTTGGGAGGC TCAGTCAGAA GAATCGCTTG/AACCCAGGAG GCGGAGGCTG 
CAGTGAGCCG AGATCGCGCC ACTGCACTCT AGCCTGGG^A AC AG AG CC AG ACTCCGTCTC 
AAAAAAAAAA AAGTGAGAGA GATTGAGGCT GGGAT^TATG GCTCAGGCAT CATGCGCGTG 
TAGGGGGCAG TTAAAAAGCA GAAGTAAGAA AGa/tGCCTA GGGAGGCAGG AAGGGTGAGG 
T GAG AG GAGA AGAGGCCCAG GACCAGATTC TpAGTCACCAA CAGCGTTTAA GGGGCAGGTA 
AGGAAAACAA AACCATCAGC AAAGACTGAf/ AATGAAAGCC CAGAGAGGAA GGAAAAGCCA 
CACATACAAT CAGTACAGCT CCATCTG&YT AAAGGTAGCG CCCCCCCCCC CCCAAATCAT 
TAGAGAAATG CCTGATTCGG TTTTOTGTGG ATTTTTCCTA AGAACCTAGA TGTGGGGAAT 
AGAAATAAAT GGTTCCCTCT GT^TCATCCC CTCCCTGCCC TCTGAGAGGA AGCTGTGATT 
GCGTGCTCCC TTTCTGGGGG /gCAGATACT TTCTGGACCA ACCCTCAGTA CCGTCCGAAG 
CTCCTGGAGG AGGACGATGA CCCTGATGAC TCGGAGGTGA TTTGCAGCTT CCTGGTGGCC 
CTGATGCAGA AGAACCtfGCG GAAGGACCGG AAGCTAGGGG CCAGTCTCTT CACCATTGCC 
TTCGCCATCT ACGA^GTGTG TAGTCCTGAT TGGCTCCAGC CCAGGAAACA TACTTTCCCA 
GAGAGGACGC TT^CAGGGGC TTCTAGAGGG GCCCTCTGCT TCCTCAATAC CAGTGACCCA 
CAGAGCTCCT ^GTATCAGGA CCACTTGTGT TTGTAACAAG CAAAAAATAC CAGGGGGGGC 
ATTAGAGAGCS CAGTGGAGCG GGCCTGGCAG AACAGGTGCC TGGGGGTCAG GCTTCCGCAT 
GCGGGCTGfcA GTTGCTGGCA TTGCCTTCCG CAGGCTCCTC ATCCTCATTC ACATCTGAAG 
CATCTTCCTT TCTGTTTCTT CTCAAGGTTC CCAAAGAGGT ATAGCAGCAG CAGCGGCCAG 
cagt/gtgtg CAGCACTACC CAGGGGGGCC CGAGTCTGTC TGTGGCTCGT CGAGAAGCTT 
cc/ggtgggg TTTGTGGGCA GGACTTGTGA TAGGAGAGGG CCTTGCCTGT TGTTATTTCC 
OACTTGCAGA GCAGGTTGCC TCAGGGCATT GCATGACCCA TGACTACCAC CCCCAGGATG 
/TGCACTTTCT CCCTCGCACC AGACACTGCA CGTCACACAC ATGCCTTTGC ACACTCACCC 
TCCTCCACGC TTACAGCCAC ACACACAGTC ACACAGACGC GTTCTGAGGG TGGCTGCCCG 
CTTGGGATGG AGGAATCACT TCCCTCAGAA CCCAGCCAAG TCCTCTAGGC CTCCTTGGGG 



gtccttccag 
tgtggggatg 
ggtagggagc 
ggtgaccagg 
cccgaggggc 
acctgcagaa 
acatgcggga 
ccacctacga 
acctctctga 
actccagagg 
tcctgagcca 
gagcacctac 
agaaagaaat 
aaaaacatat 
agagggatag 
catcctgtcc 
tggaagatgc 
aatactaaca 
tggttcctgg 
aggaagccgt 
agaccccaca 
cagtggtgag 
tgcaggggac 
tgtgctgagc 
ggccttaag 1 
ctggaaaoga 
gagccgcagc 
agaa^gtaag 
tg^cctggac 
ataatgtgcc 
ctggcacagc 



CCTGAGGGGC 
ACACATTTCC 
CTATTTAACC 
GAGTTGGGAA 
TCATGTGCCC 
GGACTTCTTC 
GGTGTCCCAG 
GCCCCACCAG 
GTGAGTGCTG 
TTGAAGGCAT 
CTGGCCACAT 
TATGTTCCAG 
CCCTGCCTTC 
GTAGCATGTT 
GAGGTGTTGG 
CTGGCCACCA 
ACCAGGTCCA 
AGAACCTGCG 
GGTTGGGGT 
GCACCAGAGC 
TGTCTGTATT 
TGGETTAGAT 
AGATGGTGCA 
.GTCCCTCCT 
ACCGGGGGCC 
GAAGCAATTT 
GAACACTGGA 
CTGGTGCCGG 
AATTTCTGTA 
TTGCAAGGCT 
ATGGGCACTC 



TTCGGAGCTG 
ATTCACTCTG 
CTTGGGAGTC 
GGGACCCTTG 
TGGGCTCTCC 
CTGTACAACG 
CGCTTCCGCC 
GAGGGGGAAT 
GCCCAGCTTT 
GAGGCAGCTA 
TACCCCCATT 
GCACTGTCCT 
ATGGAGCTTA 
AGATTTGGA 
GGATGCTTCA 
CAGATJ^AGCT 
TGGGTAGGTG 
CCTGGGCT 
TTCCAGGGGT 
AAACCGTCCA 
CCTCACAGGG 
CTTCTGTGCG 
GGGGAGAATG 
TGGCACTGCA 
ATTGAGGCAG 
GAACAATCGG 
TTCTGAGACT 
ACCTGGTGTT 
ATCCCTCTCA 
TTTGTGAGGC 
AAACAGAGGT 



AGGACAGCTG 
AATCACAACA 
GGGAAGTAGG 
GAGGTGGCTG 
CCATCTCTCA 
CCTCCAAGGC 
TGCCTCCCAG 
TCATCCTCCG 
CCCACGTGTT 
GACACGTQTC 
CATTCATTCA 
AGG^ACTAAG 
'ATTCTAAC 
AGGTGATATG 
AATTTTAGGT 
CATAGCCCCT 
GCTGGGTCAT 
TGGCTGTCGG 
TCTCTAGAGG 
CGGGCCTCCT 
AAGTTGAAAA 
AAAAGTCCAG 
GGCACTGGCA 
AATCCTACTT 
TTCAGGGGCT 
AGGGAACAAG 
GGATAACATT 
GACACTTGGA 
CTCAGTTTCC 
TTCATCAATG 
GCTTTTTCAC 



TTCTGGTAAG 
GAAAAGGGAA 
GAGGTTGAAA 
TGGCAGGACA 
GATGCACGG 
CAGGAG 
CGAC^ACGTC 
GGTCTTCTCT 
TCTAAAAGCT 
CTCCAGGGTC 
TCCATTCTGT 
GATAGAGTAG 
ATGAGACAAT 
GAGCAAAAAT 
TAGCATGGCC 
GCCACTCTGA 
GCCTTTGGGG 
GGATGGTGCT 
CTGGTTCTGG 
GCTTGCTTCT 
TACCATCTCC 
AGGGTCCCCT 
GAGGGAATGG 
TGGCATGGCC 
GGGAAATATG 
GCCACAGGAA 
GGATTTCACA 
TCCTCCACTT 
TACTCAGTAA 
AGGTGATGTA 
ACTTTACACC 



TGTCCCTGj 

gaggaa/tga 
ctg7gacatg 
jacgttcct 
aacaagcagc 
acctacatca 
atcgtgccct 
gaaaagagga 
cacatggccc 
cttctgctgc 

GATATTTATT 
TGAAGTAAAC 
AATGGATAGG 
AAAGTAGGGA 
AGGAAAGCCA 
TCTCTGTCCT 
GGCTCTGAGC 
GACATGGGGC 
CTTGGCTGCC 
GGTGACACTG 
GTGGATCGGC 
TCCCTGACCA 
GAGTCTGGGC 
AGAAGTAATC 
GAAGAGGGTC 
GGGATGACAA 
CATAGAGAAA 
ACCAGCGGGG 
AACGGGGATG 
TGTGAAGTGT 
TTACAAGGTA 



CTTTTCACAT 
CCCTGGTGTT 
CAGAGCTCAG 
GGTTTTTCCG 
GCACAGAGTG 
TTGGGCTGTA 
GGACAAATGC 
CTGAGCCAGT 
GTAGGTGCGT 
CAGGACCTGC 
GTGCAGGGTT 
TTGGGCTCCA 
ACACAGGGAG 
GGGCAGCCTG 
CTCAAAAATT 
AGCCATGTTT 
ATTTTTGAGA 
CTCACTGCAA 
CTGAGATTAC 
GGGTTTCACC 
TCAGCCTCCC 
TTTTTTAAAA 

gtaagagtaj 
aagaaga#tt 
tttctg6acc 
ctgg?ggttc 
cag&aaaaga 

TC2CATATGTA 
^AGACTTGCC 
TGACAACTAC 



GTGTCATCGC 
CAAGAAAGGA 
TAAGTGGCAG 
CACCTCCACG 
CTGTGTGTTG 
GTCAGCTGAC 
CCCTCTGAAC 
GCCAGGTCTC 
GGGGATCTGT 
AAACCCAAAA 
TTTGATGTCC 
GTGTCGAGGG 
CAGGGCCCTT 
CCACAAGCTG 
TTTAAAAAAT 
TTGGTGTTTT 
CGGAGTTTC 
CCTCCGCOTC 
AGGTGC?CCAC 
ATGJTTGGCCA 
lGTGCTGG 
.GGAAGAAA 
ATTATAAAAA 
CTCTTCTCCC 
CTGGAGCCCC 
TGAGAACTTA 
AAAAAACCAA 
TGTGCATGCA 
TCTTCCTCCC 
GATTTGCTGG 



GATACTTGCA 
AGCAGAGGCT 
GGTTTGGAAC 
CTGAGGCCAG 
GGCTCTGTGT 
AGTCCTTTGT 
TGTCTTCTGG 
CAAGTGCCTT 
TCTGGTCATC 
GCTTATGGGA 
CTGCACTGAC 
TCAAACAAGG 
TGGCTCAAJ^C 
GGGCTT^TAC 
ATTOTGTAAG 
1TAACCAAT 
CTCTTGTCAC 
CCGGGTTCAA 
CATCACGCCT 
GGATAGTCCT 
GATTACAGGC 
GAAAACCTTA 
CAAAGTCAGA 
TTCACCCTCC 
ACCCCAAGCT 
CTTTTCACTT 
GGTAGGTGTG 
TGTGAAGTGT 
CCTCCTTCCT 
GGGAAGGCTA 



AGGTTGCTGA 
CAATGGGGTT 
TCACATTCAG 
CCCCAGGCAG 
GTTGAGGAGT 
GCTCTGTGGG 
GCAGTGACAG 
CTGAATGACi 
TGGATGCffGG 
GCTGGCACGT 
AC^GTTGTCT 
lTTTTGGGG 
TGATAGTTGC 
CAAAGAAAAT 
TCAAAATCCA 
TTCATTTTTT 
CCAGGCTGGA 
GCAATTCTCC 
GGATAATTTT 
GAACTACTGA 
ATGAGCCAGC 
GCCAGAAGAT 
GCAGTCACTG 
ATGCCCCTTT 
AAAGACCAGG 
ATTCTGCATT 
TGGGTAGAGA 
GCATGTGTGA 
GAGCTTCTGC 
CGTGCCAAGC 



gaggtagatg 
gaatgacttc 
actctctgac 
tgagaagcc 
cttgtg^ctg 
gatgacgtag 
:atggtcat 
acaggcgatt 
tcatcgggtg 
cacgtgagta 
gcagttctcc 
cgtgggccaa 
cgcagggatt 
ctccctatgt 
ttgttaggtc 
tat tat t tat 
gtgcaatggc 
tgcctcagcc 

TGTATTTTTT 
CCTCAGATAA 
ACGCCCGGCC 
CTTTTTCCTT 
GTGTCTGGGC 
TTGGCTCCAT 
ATACAGGGAA 
TACTGTTTCC 
GCATGAAGTG 
GCTCATATGC 
TGGGGCCGAG 
ACTCTTTTAG 



GGGTTA7&AT 
TCTGAGTTCA 
T(2!CAGACTTA 
AAAGTCCGAA 
CCTTGGGGCT 
GCCAATGGGA 
AATCCTGACC 
GGTTTTAGTG 
CAGTATTGAT 
GAGCAGGCAG 
AATTTGACAT 
ATCTGGGAAG 
ACCAGGCCCA 
TAAATGCTTG 
AGTTTGAGAG 
TTATTTGTTT 
ATGATCTCAG 
TCCTGAGTAG 
AGTCGAGATG 
TCCGCCCACC 
ACCAATTTCA 
GCCATATGCA 
AT G GAG GAGA 
GTGATTCAGA 
GCCACAACCA 
TTTTCTTATG 
TGTGTACTCA 
ATCCATGCAC 
CGTGCAGTAA 
GTGCTTTCCA 



TGATTAATTC 
AGAGGGAGAA 
AGAGCCAGTA 
GCCATCATAC 
CCATCACAGG 
CACTACCACA 
TTATAACAGT 
TCAAAGAAGT 
CCATGTCTGT 
ACACTTCAAG 
GGATGCTCCA 
TTTGCTCTGG 
GCAAGCCAAG 
TATCTGCTTG 
GGGCTGCTTT 
AATCCCCACA 
GGCACCATGC 
CTTGAAAATA 
CAGTATCTCA 
AAGGATGAGT 
CTGTCACCCA 
TGTTCACACC 
CACACCCAGC 
ATGGTCTTGA 
ACAGGCGTGA 
GATTCCAGO 
TTCAATCT 
GATGTGGGGT 
AAAACACCAT 
AGTCTACAAA 
TTTCACGTGC 



cttcctcaca 
aaggtacaga 
cttagagcca 
gccatcagat 
ggtcctccag 
ttccaggcac 
atctacaaag 
gaacttgccc 
ttactccaaa 
gcctgtgttg 
tggccacacc 
gatccccaca 
gcctctgggg 
agctgtccta 
tctgctccga 
gccttgcctt 
agcagccgct 
tcccttgttt 
tttgatcccc 
aaactgagg 
gcctggaotg 
attttoctgc 
taat/ttttt 
tc/cctgacc 
iccccctgc 
cagtgttctg 
actttcaggg 
ctgagggcca 
gtttactgca 
atacttgaca 
tgagctctca 



ACAGCCCTAT 
CTTGACTAAC 
GGCAGTCTGG 
TTGGTGCTAG 
GTACTGGTGC 
TGTGCTTGGG 
TAGGTGCTGT 
AAGGAACAGA 
ACCTGTGTTT 
TCCAGACCCA 
ATATCCATCC 
AGCTTCAGCT 

cctgctggga 
gatgagcagc 
gaggctctg 
cccccgg^tt 
ctccg/cctt 
gtotagcatc 
:aagagccc 
cagagaggat 
cagtggcttg 
ctcagcctcc 
gtatctttag 
ttgtgatctg 
ccggccagag 
atggctcacc 
ttgtagaggt 
agagctctgt 
ggaaatttaa 
atcactgcac 
atactctacc 



G AG AT TAG T A 
TTGCCCAAGG 
GTCCAGAGTC 
CATTTCTGGT 
TGGCCCAGAC 
GTCAGTCCCT 
TATTTTTCCC 
ACTAATGAGT 
CTTGCCCTCT 
CACTCGGG 
TACACAJCCC 
GCTTQAGCAA 

gcgaaagctg 
Itggaagggc 
ctgcccagtt 
tccctacagg 
ttcatatcct 

TTAAATGTTT 
TATGAGGAGG 
ATTTTTGGTT 
ATCTTGGCTC 
CAAGTAGCTG 
TAGAGATGGG 
CCTGCTTCGG 
AGGATATTTC 
CACTGACCAT 
TCCTTTGAGG 
TCTCATTAAT 
TTGGACAGTG 
TAGATCATGC 
ATGAGGAGGG 



CTATAACTAT 
CCACACAGCC 
CGTGTCCTGA 
GGTGCCTGGT 
CAGAGCTG 
CTCTTTyTTTT 
CTTTCACAGG 
GGGGAAAATG 
'TCTCTGATG 
TGCCAGTGTG 
CCCTCAGACT 
GACACTGCTT 
GGGAGCCGTT 
AGTGGTGCAT 
GTTCTCTGCA 
TGCACCGCAT 
TGTCACTTGC 
TTGCAGTATG 
GAAAGCAGAT 
TTTTTTGAGA 
ACTGCAAGCT 
GGACTACAGG 
GTTTCACCCA 
CCTCCTAAAG 
TTAATGAGGG 
TCCACTAATC 
TGCCTCAGTA 
CAGAGAAGCT 
TTTCCATCTG 
TGCTTTTAGC 
ATGGAGTGGG 



ccccatt/tc 
agagaggggc 
acgacaagag 

IGTGATGGAT 
ACTCCTCAGG 
TCCCCCCCAA 
T GAG AT AG AC 
GAACTGGAAA 
CCAGCCCCCT 
TGCCTGGCAG 
GTGACCTCCA 
AGAAGGCAGA 
TCCACGGGTC 
GAGTCCAGGC 
TTGCAGCCTC 
CCACAGTGTT 
AC GAG CAT GT 
ATTTTGCATT 
TTTACCATTA 
CAGTCTCACT 
CCACCTCCCA 
CACCCACCAC 
GTTAGCCAGG 
TGCTGGGATT 
GCAGGGCTGG 
CGTGTCCTTT 
CTTCCATGGT 
TGTGTTTTTA 
GAAAAAAAAA 
ATTCTTAGCA 
TATGAAAAGA 



TAAAGAACTG AAGTCACACG GCTTGTCAGT GGCAGAGATA GAGCTTGAAC CGAGGTTGAA 
GAGCTCCCGC CTATTCCTTT CCTCTTCTCA CTGGATAAAG CTGCTCCAAG AGAG0TGCTG 
CCTCAGTGTG CCTGTTCAGA CTGTAATCCT CCCTTCCTTC CTGCCTCCTC CJ2TCCTCTCT 
CCAGCCCATC ATCTTCGTTT CGGACAGAGC AAACAGCAAC hAGGAGCTGQT GTGTGGACCA 
GGAGTCAGAG GAGGGCAAAG GCAAAACAAG CCCTGATAAG CAAAAGCAGT CCCCACAGGT 
GTCTGGGCAT GTGGCATGGG TGGGGTGGCC AGCACGCTAC AGGGJ^CTTCC TATGCGCTTG 
G GAT AC AC AG GGGCTGGAGG CTTCCCAGGA GTTTGTCTTG A^CATCTGGA GGTTTGAATT 
TGTCCCACTG ACCTTTTCTT TCAGCAAGTT CCCCTGAAAT/tTGGGCTGCT GCTTGGGTGA 
ATATCCCAGG ATGGGGGTTC CATTCTAGGA GTGGACTGGC AGGCTGAGCC TCCCATGGAG 
CTGATCCAGC C AG GAT AC AG AGAAGGGGAG GCAAA^GCTG AGACAGAACC AGCTTGAGAG 
CGGAGGCGCA ACTCTTGTCT CCTGGTGGCC TTG^GCATTT CACAATAGGG GGATAAAGGA 
TAGGAGCAGA AAAGTGGGGC TGACTTCAGA AATGGGGTCC TCTAGAGCTC ACGGGAGGGT 
GTTAGATTGG AGTGGGAGCT TAGTGGAGGj/ GAGCCTTAGA GGCAAAAGTC TCCAGACCAA 
TCCAGGCCCC CTCTTCTATC CGGGGGC(2!CC TCTTCTATCC AGGGCCCCTC TTCTGTCTGG 
GAGCCCCTCT TCTATCTGGG GCCTCATGCA GTGGGGCCTA GGGGAGGTTC TCTGAGGACT 
TGGCCTTGAT GACAGGGTGG CTGGAGGAAT CAGAACGGTC AGACCTTCTT TGACCTGCGG 
GCACCTTTAG TTGGAATGCT GAGGCCTGGG ATGGTGGAGG GGGCTCTTGC AGGTGGGGAC 
TGGGGTGGCG GGGAGGAGGc/ TGTATGGCCG CCATATCTCC TTTGGCTGGG GGCGTCAGGG 
CTGGAGAGGT GTGAAGAGTC CCTGAGGCCT CGATGCATCT CACTCCAGCT CACCAGGTCT 
GCATTTGCCC GTCCCCAGCT CCTGCTGCCA CCCCCGGCCG TTTTAGGCAC TTGGCTCCCT 
TGGCCCAGAG GAGOTTGCCT CACAGGCCTG TGCACCTCTG ACCCCTGTGA ACCAGTTTTC 
CTTTGTGCCT Cc/cAGCCAC AGCCTGGCAA CTCTGATCAG GAAAGTGAGG AACAGCAACA 
ATTCCGGAAC ^TTTTCAAGC AGATAGCAGG AGATGTGAGT ACCTCCAAGC CCAGGACGCC 
CACAGGTGCy TCCTTCTCTC CTGGATTAAC TGCTCAGATT ACCAATTATT TCATTATTGT 
TTGGTAGA(SG TCACTTTGGA CTTCGGTGGA GCCAGGGGAT GTGTGCGTAG CACACAAATC 
CACAAGOCCT TGAGTTTTGG ACTGCCACGT CTGCTGGGGG GCTCAGAGGC CTTTTTGCTC 
tgagc/gccc ACGGTGGTCC TGATAGCTGA GGTGCAGTAT CTGGCCCCCT GTCTTCCTCA 
GAA^GCCCC AGCTTCCCAT GACATAATAG CACCGACAGG GATTTTACAA AC AC AG CC AG 
GTCSGAATTTG TTTTGCAAAG TGTCCGCGCC AGGAGCTGCT GTACTCCTGA ACCATGACCC 
TCCTCTCCCT TCCTCCTCAG GACATGGAGA TCTGTGCAGA TGAGCTCAAG AAGGTCCTTA 



ACACAGTCGT 
CCCGTTGTCT 
ACACGGGTTC 
GCCGCCCCTT 
CCAGTCAGGC 
ACATCTTGGA 
TAACTGCCCT 
CTTAGGAGAG 
AGGGGAGGGT 
CTCCCTCCAA 
GACAGATGGC 
GGCCTGGCAG 
ATTCAGTGTG 
AC AC AG AC C A 
GTGCTGAGAA 
GGGGACTAGC 
TCCCAGGTCA 
CTCCACGTCC 
TGACATCATT 
CTGCTGCTTC 
CTCTCTTGCA 
ATCCAGGCTG 
GTTACTGGTG 
GGTCCTCTGA 
CTGTGCATT 
AGCTCAAGfGT 
AAGGTGGAGG 
GGTGBAGGGA 
TGOACTTGAG 
CCTTCTTAGG 
yCTGGGGGTCT 



GAACAAACGT 
CAAAGCAGCT 
ACACTGGAGT 
CCCGACCCTC 
AAAGGGCCCT 
TACTCGTCTG 
AACCCCTGTG 
CGGCTCCTGG 
TAAATAGTAC 
ATCCAGGGGG 
TCTGGAAAGC 
GTGGGAAGAG 
TGACCTCCAT 
GTCCGGCACC 
GGAAGGGGTG 
TACTAGGGCC 
CAGAGTGTCC 
ACCTCTAACA 

accatgcgg 
gttaggc/gg 
ggggc/gttg 
aac^agggcc 
attctctgcc 

1GGGAAGTTA 
TTTCACAGGA 
TCTGGAGGTA 
GGTCAACGGG 
AG G GAT AG G A 
GCCCAAAAGG 
GAAGATCTAG 
GACCTGCTGG 



GAGTTGCTCA 
CCTCACTCTT 
CCTGCCGTAG 
TGTCATCAGC 
AATTTGTGCC 
AAAGGGGTTG 
CTTCTCTCAG 
GTTACAGAGT 
AACAGGGCAG 
ATTTTGCTGT 
TCAACCTGCA 
AAAATGAAGC 
CCTCAAATTT 
ATCAACAG 
TCAGGGM'GT 
CC AC TAG AG A 
GAGfAGGCAGG 
^GTCCCCTC 
ACGCAGACAA 
AGGGCATGTT 
TGGCAACAGG 
AATGACCTCT 
TGCACATCTT 
C AG TAG TAG A 
GCTTCTCATG 
AAGCATAGGC 
GCGGACTGGA 
ACAGAACATG 
ACCTCTGCTC 
GAGAAAGGAA 
GACTGTTCCC 



AACCAAATGG 
CTCCATCCCC 
CATGATTGCG 
CCACGGGGGC 
CAGGGAAACT 
TTAGAGGCGG 
GCCTGGGATC 
AGGCGCAATC 
TGGGTAGG 
GTGCTGTGTA 
GGAGTTCCAC 
GTGjGGAGTCA 
CTATTGCCA 
ACGAGATGCG 
GGACCCGAGA 
AGGAGAGGGA 
GAAAATAGAA 
CACAGGATTC 
ACACATGAAC 
CAGTAAGTGG 
CATCTCACCT 
TTAGGCCCAG 
TGTGCTGATG 
GGCGGAGTGC 
CATTTGACAA 
ACAGCACATT 
CCCAGGGTGT 
GAGGGAGGCT 
CCCCAGTCAC 
ACAGTAAGCC 
TTTCCTCTTG 



GGGTGGGGTG 
CCAGACAAGG 
CTCATGGATG 
CAAGGCAACA 
TAAGGAGACC 
AAGGGQAGGA 
CTGCCCAAGC 
ITGACTGGT 
AGCCCGGAGT 
GCCCTGACCT 
CACCTCTGGA 
AGAATGGGGT 
GAAAATTTTC 
AAATGCAGTC 
CGGTGGGAGC 
AAGGGCTTCT 
GACAGGCCCA 
CACCTCAACA 
ATCGACTTTG 
GAGAGGGGGG 
GATAATCTCC 
AATGGGATGG 
AGGGACAGCA 
GCCTGTAACT 
G GAT G GAG AT 
CCCCCTACAC 
GCTCCTCATT 
CAGCAGGCTC 
TTGATGCGGG 
ACTGCTTCTT 
CCCCGTAAGA 



GGTGGGGAGT 
ACCTG^AGAC 
TA^CCTTCCT 
'ACAGGGTGC 
CTGATTCAGA 
TGTTGGGTTG 
AAAAGTGGTC 
GGTGGAGTGG 
CTCCTAGACC 
CCCTCCTCCA 
ACAAGATTAA 
TGATTTGGAG 
AAACACTATG 
AACGACGCAG 
AGGAATGGGA 
CACTTTCCCT 
AGGCCTCCAG 
ACCAGCTCTA 
ACAGTTTCAT 
CTGCCCTCTG 
AGTCTGCTCC 
CAAAGGGAGG 
CTGGGCACAC 
GGCCTCTGGC 
GGTATCATCA 
ATTAAAACTC 
TCCACACAGT 
CCAGGACACA 
AAAACATGCA 
GGAAAATCTT 
TTCCTAGGGC 



GGGGGGGGGG 
GGCTGCAGCT 
ACTCAGGATT 
CTACACCCCT 
TACCCATCCT 
AACAAACCCT 
CGGTTCTGAA 
GGTGGCACTC 
GGGCCAGGAA 
TAGGGCTGGT 
ATCTAAATAA 
ATCAGCCATG 
CAATGAAAAA 
CTTGAGCATA 
TGTTGCTGTA 
CTGAATAATG 
CATGACCGTG 
GCCTGCGGCA 
GCCAACGTGG 
TTAGTAAAAG 
AAGTCCACAG 
GGTGTCATCT 
GTGCTGGAGC 
TTTCCTCCTT 
AATCTGGG. 



GGGGGTCACT 
CACCATGTAT 
TCAGTTTCAC 
ACAGGCTTCC 
TGATCGGTCA 
TGTCCCTTTG 
GCGAGTGCTC 
AGCACCTCCT 
CCAAACCAGC 
TACTTTGGGC 
AGGCATGTGT 
CATGACTGAA 
CACACACAAA 
AGAATGGCTC 
AAAGACATCA 
GAGTGGAGAT 
GACAAAACCC 
TATCTATAGG 
AAAGGGCTG 
TACTTCATTT 
CTTTAmCCA 

gat to cat tc 
ag/tctaggg 
(accccctcc 
atgtagtcac 



cttttctgat 
gcctgaacca 
cctctatttc 
aggcacctca 
tgcctagcct 
ccatgtggag 
ctgcttacct 
tgtgctagag 
actgggttct 
tgtccaactc 
atggctggtc 
tggcttccaa 
aacaaaatct 
agatactttc 
agaata^atg 
tgagotatcc 
tg^acgcagc 
3vtcctgtgt 
ccgtgaatat 
tcctcttgta 
aaatgtaaga 
ttctaatcca 
cat at att tc 

TTTCCCAATT 
CAGG 



CTACATTCTG 
GGCTGGCCTC 
CAAAGCCATT 
TCAGTCATGT 
GACCCTTTAG 
GAAAGTGCCT 
TGCTCTAGGC 
CCCTCCATCA 
ACTGCTGTQTC 

ataagt/tgg 

CCCOTGTGTT 
TOATATACTC 
'GAATTTTGT 
CAAGACATAA 
GGGTCATGTA 
TAGCTCCTCT 
TGTTTGTTTG 
TTTCCACCCA 
GCAGACAAGG 
TTTGCTTCAT 
AGGCTATTTG 
TATTCAATAT 
TCTTAAATAG 
TATTTGGGTC 



ATCTTGGGAC 
ATCCAAAGCC 
TACCTCAAAG 
TCCTCCTCCA 
TAAAGCA&tfG 
GCCTC^GGTC 
TG7CTGCAGA 
CTTCACGCT 
GGTAAACTAA 
CTGCATTTTG 
TTGTTGTCTC 
ACCTATCACC 
AATCATGCCT 
AAGGAAGGCA 
CAACGGGAGG 
GCTCACTAAC 
CTAAACTTCT 
GTTTCCTTCT 
TAACGAAAGT 
TCTTGCTTCA 
CTTATAAACA 
TAAAAAATCA 
GAGAAAGATT 
ACTACCTTGA 



TTCTTTCA£ 
ATGCAJZGATC 
GAGCCAGCAG 
TTTACCCCC 
AGGTAGGAAG 
CGAGCCGCCT 
AGCACCTGCC 
GTCCCACCAT 
CTCAGTGGAA 
AAAAAAGCTG 
ACATTTAGAT 
TACAAGAGAA 
ATTGCTATTT 
GAGGAATAGT 
GGCCGGTTAC 
TGACCTGTCG 
CTGGACCATG 
TCCTCGCTAA 
AAACCGTCAA 
CAAAGTTACG 
TTTTGAGTCA 
GAAACCAAGG 
TTCAACAGCT 
ATTTAGAGTG 



(2) INFORMATION FOR SEQ ID NO: 5: 

/I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(D) OTHER INFORMATION: /label= FIGURE 2 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) POSITION : 1303.. 3764 

(D) OTHER INFORMATION: /note=CDS 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) POSITION : 1631 

(D) OTHER INFORMATION: /note= CGA->TGA 
LGMD2A family 



;L1 mutation in one 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) POSITION : 1848 
(D) OTHER INFORMATION: /note= CTG-^ 

LGMD2A family 



nCLl mutation in one 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) POSITION : 1853 

(D) OTHER INFORMATION: /noj 
LGMD2A family 



CAA->CA ; nCLl mutation in one 



(ix) FEATURE: 

(A) NAME /KEY : misc_f e^ature 

(B) POSITION : 2004 

(D) OTHER INFORMATION: /note= GGG->GAG 
LGMD2A family 

(ix) FEATURE: 

(A) NAME /KEY : r/isc_f eature 

(B) POSITION / 2248 

(D) OTHER INFORMATION : / note= CGG->CG 
LGMD2A family 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) POSITION : 2364 

(D) Ot/eR INFORMATION: /note= GTG->GGG 
LGMD2A family 

(ix) FEATURE: 

(A/ NAME /KEY : misc_f eature 
I) POSITION : 2382 
) OTHER INFORMATION: /note= TGG->TAG 

LGMD2A fafinily 

(ix/ FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) POSITION : 2771 

(D) OTHER INFORMATION: /note= CGG->TGG 
LGM$2A family 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) POSITION : 3018 



nCLl mutation in one 



nCLl mutation in one 



nCLl mutation in one 



nCLl mutation in one 



nCLl mutation in one 



LGMD2A 
(ix) 



one 
LGMD2A 



(D) OTHER INFORMATION : /note- GGG->CAG ; nCLl mutation inySne 
family 



(ix) 

LGMD2A 
(ix) 

LGMD2A 
(ix) 



FEATURE : 

(A) NAME /KEY : misc_feature 

(B) POSITION : 3372.-3373 

(D) OTHER INFORMATION : / note- Deletion AC 
family 

FEATURE : 

(A) NAME/KEY: misc_feature 

(B) POSITION : 3533 

(D) OTHER INFORMATION: /note= AGC->GGC 
family 

FEATURE : 

(A) NAME /KEY : misc_feature 

(B) POSITION : 3609 

(D) OTHER INFORMATION: /not e= CG^->CAG 
family 



nCLl mutation in 



'nCLl mutation in one 



nCLl mutation in one 



one 
LGMD2A 

(ix) 



FEATURE : 

(A) NAME /KEY : misc_feature 

(B) POSITION : 3616.. 3619/ 
(D) OTHER INFORMATION:/^ 

family 



i= Deletion AGAC ; nCLl mutation in 



FEATURE : 

(A) NAME /KEY : misc /feature 

(B) POSITION : 3666.-3666 
(D) OTHER INFORMATION: /note= AG->TCATCT 

LGMD2A family 



nCLl mutation in one 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
ATATCAGTTA GCCTGGt/tC ACTATACAGT ACATCATTTT GCTTAAAGTC ACAGCTTACG 
AGAACCTATC GATGa/gTTA AGTGAGGATT TTCTCTGCTC AGGTGCACTT TTTTTTTTTT 
TTAAGACGGA GTc/cTTTCT GTCACCTGGG CTGGAGTGCA GTGGCGTGAT CTGGGTTCAC 
TACAACCTCT c/cTCCTGGG TTCAAGCAAT TCTTCTGTCT CAGCCTCCCA AGTAGCTGGG 
attacaggca/ccccccgcca CACCCGGCTT ATTTTTGTAT TTTTAGTAGA GACAGGGTTT 
cactattg/t GTCCATGCTG GTCTCGAACT CGTGACCTCA TGTGATCCAC CCGCCTCGGC 
CTCCCA^GT GCAGAGATTA GAGACGTGAT CCACATGGCC CAGCAGGACC ACTTTTTAGC 
AGATTj^AGTC CCAGTGTTCA TTTTGTGGAT GGGGAGAGAC AAGAGGTGGC AAGGTCAAGT 
GTGC^GGTAG AGACAGGGAT TTTCTCAAAT GAGGACTCTG CTGAGTAGCA TTTTCCATGC 
^CATTTCC AATGAGCGCT GACCCAAGAA CATTCTAAAA AAGATACCAA ATCTAACATT 
VTAATGTT CTGATATCCT AAAATTTTAG GACTAAAAAT CATGTTCTCT AAAATTCACA 



GAATATTTTT 
TTTTCCATTC 
TGTCAGCAGT 
TCTGGCAACA 
ATGCAGTCTC 
GGCTCAAGCA 
CATGCATGCT 
CATTTCTCAG 
CTGTCCGACC 
TGACAGAATT 
CTTATTTTCT 
GCATCTGTGG 
GCCCAGAGCA 
ATCAGCCGCA 
AAATGTCTAG 
CTCTTTTATA 
GAGAATCCCC 
GGGGACTGCT 
C GAG T CAT AC 

ttctggcgct 
aatcaactgg 
aaggcttatg 
gccatggagg 
gacatgta/a 

GATGATQ5GCA 

attgoacgga 
agaggctcag 
g/ctgcgggc 
/aaaggtgaga 

GGTTCTTGGA 



GTAGAATTCA 
ATTTGATGGC 
TCTCAGCTTC 
CCAGCACTTC 
TCTTGCTCTC 
TCTTCAGGCC 
GCTGGTAGGA 
GAGAACTTAT 
TTCTGATGGG 
ACTCCAACTT 
TTAAAAAGCT 
CTCCAAGGAC 
AGGCCACTGA 
ATTTTCCTAT 
AAAAGAAAGT 
GCCAGAAGTT 
GATTTATCAT 
GGTTTCTC 
CCCATGATCA 
ATGGAGAGTG 
TT'LTCACCAA 
'AAGCTCCA 
ACTTCACAGG 
AGATCATGAA 
CGAACATGAC 
TGGTAAGGAA 
ATGAAAGACC 
TGGTCAGAGG 
AAGTGAAGCT 
GTGATAGATG 



GTACCTCCCG 
CAGTAGTTGG 
TTTCCAGTGT 
ATGGCAACAG 
ATACTCACAG 
ACTGAAACAC 
GACCCCCAAG 
GGCTTCAGAA 
CTTTCAACTT 
CCCCTTTGCA 
TTTTCTTCCA 
AGCGGCTGAG 
GGCTGGGG 
TATCGG^GTG 
TCTT^ATGTG 
CCjz!cATCCAG 
'GATGGAGCC 
AGCCATTGCC 
AAGTTTCATC 
GGTGGACGTG 
GTCCAACCAC 
TGGTTCCTAC 
AGGGGTGGCA 
GAAAGCCATC 
CTATGGAACC 
TATGGATAAC 
GACCCGGACA 
TCACGCCTAC 
GGTGCGGCTG 
GAAGGACTGG 



TTCACCCTAA 
GTGGTCTGTA 
TCACCTTACT 
AAATGTCCCT 
TGTTTCTTCA 
AACCCTCACT 
TCAACATTGC 
TCACAGCTC 
TGAACTQSAT 
GTTGOTTCCT 
AACZCCACTTG 
ICCCGGTCCC 
GGAAACCCAA 
AAAGAGAAGA 
GACCCTGAGT 
TTCGTCTGGA 
AACAGAACTG 
TGCCTGACCC 
GAAAACTACG 
GTTATAGATG 
CGCAATGAGT 
GAAGCTCTGA 
GAGTTTTTTG 
GAGAGAGGCT 
TCTCCTTCTG 
TCACTGCTCC 
ATCATTCCGG 
TCTGTCACGG 
CGGAATCCGT 
AGCTTTGTGG 



CTAGCTTTTT 

taactgccta 
cagatactcc 
agccaggtt 
catctatxttt 
ctct/tctct 
t^agaaatc 
gtttttaaga 
gtggacactt 
ttccttgaag 
ccatgccgac 
cagggccagt 

GTGGCATCTA 
CATTCGAGCA 
TCCCACCGGA 
AGAGACCTCC 
ACATCTGTCA 
TGAACCAGCA 
CAGGGATCTT 
ACTGCCTGCC 
TCTGGAGTGC 
AAGGTGGGAA 
AGATCAGGGA 
CCCTCATGGG 
GTCTGAACAT 
AGGACTCAGA 
TTCAGTATGA 
GGCTGGATGA 
GGGGCCAGGT 
ACAAAGATGA 



TGCAATAT^G 
CTCA^TAACA 
C7TTTCATTT 
TCTCTCTACC 
TAGTTTTCCT 
CTCCCTCTGG 
CTTTAGCACT 
TGGACATAAC 
TTCTCTCAGA 
GTAGCTGTAT 
CGTCATTAGC 
TCCTCACCCG 
TTCAGCCATC 
ACTTCACAAG 
TGAGACCTCT 
GGAAATTTGC 
AGGAGAGCTA 
CCTTCTTTTC 
CCACTTCCAG 
AACGTACAAC 
TCTGCTGGAG 
CACCACAGAG 
TGCTCCTAGT 
CTGCTCCATT 
GGGGGAGTTG 
CCTCGACCCC 
GACAAGAATG 
GGTCCCGTTC 
GGAGTGGAAC 
GAAGGCCCGT 



CTGCAGCACC 
CATTTCACAA 
CAGACCTGGA 
TGCCGCAACT 
GAGGACGATG 
AAGAACCGGC 
TACGAGGTTC 
TACAACGCCT 
TTCCGCCTGC 
GGGGAATTCA 
ACCATCTCCG 
GACAGAGCAA 
AAAACAAGCC 
GAAAGTGAGG 
GAGATCTGTG 
CTGAAGACAC 
GATGGCTCTG 
TGGCAGAAAA 
ATGCGAAATG 
ACCATGCGGT 
GTTAGGCTGG 
ATCAAGCTCA 
TCATCCAAAG 
TTTACCTC 
GTTCCTCC^C 
AGTAAAGTCAA 
CTGCOTCTGG 
GCTC6TCTGCA 

ca/ccttcacg 
/gggtaaact 
'ggctgcattt 



AGGTCACTGA 
AGTTGGAGAT 
CAGTGTCTGT 
TCCCAGATAC 
ACCCTGATGA 
GGAAGGACCG 
CCAAAGAGAT 
CCAAGGCCAG 
CTCCCAGCGA 
TCCTCCGGGT 
TGGATCGGCC 
ACAGCAACAA 
CTGATAAGCA 
AACAGCAACA 
CAGATGAGCT 
ACGGGTTCAC 
GAAAGCTCAA 
TTTTCAAACA 
CAGTCAAC 
ACGCAG#CAA 
AGGGQATGTT 

acgttctgga 
:atgcagga 
aggacccagc 
cattttaccc 
tgaggtagga 
tccgagccgc 
gaagcacctg 
ctgtcccacc 
aactcagtgg 
tgaaaaaagc 



GGATGGAGAG 
CTGCAACCTC 
GAACGAGGGC 
TTTCTGGACC 
CTCGGAGGTG 
GAAGCTAGGG 
GCACGGGAAC 
GAGCAAAACC 
GTACGTCATC 
CTTCTCTGAA 
AGTGAAAAAG 
GGAGCTGGGT 
AAAGCAGTCC 
ATTCCGG 
CAAGAAQfeTC 
ACTG^AGTCC 
CCJTGCAGGAG 
: TAT G AC AC A 
CGCAGGATTC 
ACACATGAAC 
CAGAGCTTTT 
GTGGCTGCAG 
TCACTCAGGA 
AGCTACACCC 
CCTACCCATC 
AGAACAAACC 
CTCGGTTCTG 
CCGGTGGCAC 
ATGGGCCAGG 
AATAGGGCTG 
TGATCTAAAT 



TTCTGGATGT 
ACGGCCGATG 
CGCTGGGTAC 
AACCCTCAGT 
ATTTGCAGCT 
GCCAGTCTCT 
AAGCAGCACC 
TACATCAACA 
GTGCCCTCC 
AAGAGG^ACC 
,CCA 

gtc/gaccagg 
^acagccac 

ATTTTCAAGC 
CTTAACACAG 
TGCCGTAGCA 
TTCCACCACC 
GACCAGTCCG 
CACCTCAACA 
ATCGACTTTG 
CATGCATTTG 
CTCACCATGT 
TTTCAGTTTC 
CTACAGGCTT 
CTTGATCGGT 
CTTGTCCCTT 
AAGCGAGTGC 
TCAGCACCTC 
AACCAAACCA 
GTTACTTTGG 
AAAGGCATGT 



CCTATGAGGA 
CTCTGCAGTC 
GGGGTTGCTC 
ACCGTCTGAA 
TCCTGGTQGC 
TCACCflfTTGG 
TGC^GAAGGA 
1CGGGAGGT 
CCTACGAGCC 
TCTCTGAGGA 
AG CC CAT CAT 
AGTCAGAGGA 
AGCCTGGCAG 
AGATAGCAGG 
TCGTGAACAA 
TGATTGCGCT 
TCTGGAACAA 
GCACCATCAA 
ACCAGCTCTA 
ACAGTTTCAT 
ACAAGGATGG 
ATGCCTGAAC 
ACCCTCTATT 
CCAGGCACCT 
CATGCCTAGC 
TGCCATGTGG 
TCCTGCTTAC 
CTTGTGCTAG 
GCACTGGGTT 
GCTGTCCAAC 
GTATGGCTGG 



TTTCATQTAC 
TGAC7&GCTT 
TGCCGGAGGC 
6CTCCTGGAG 
CCTGATGCAG 
CTTCGCCATC 
CTTCTTCCTG 
GTCCCAGCGC 
CCACCAGGAG 
AGTTGAAAAT 
CTTCGTTTCG 
GGGCAAAGGC 
CTCTGATCAG 
AGATGACATG 
ACACAAGGAC 
CAT G GAT AC A 
GATTAAGGCC 
CAGCTACGAG 
TGACATCATT 
CTGCTGCTTC 
AGATGGTATC 
CAGGCTGGCC 
TCCAAAGCCA 
CATCAGTCAT 
CTGACCCTTT 
AGGAAAGTGC 
CTTGCTCTAG 
AGCCCTCCAT 
CTACTGCTGT 
TCATAAGTTT 
TCCCCTTGTG 



TTTTGTTGTC 

TCACCTATCA 

GTAATCATGC 

AAAAGGAAGG 

ACAACGGGAG 

TGCTCACTAA 

GCTAAACTTC 

AGTTTCCTTC 

GTAACGAAAG 

TATCTTGCTT 

TGCTTATAAA 

AT AT TAN AAA 

AATAGGAGAA 

GGGTCACTAC 



TCACATTTAG 
CCTACAAGAG 
CTATTGCTAT 
CAGAGGAATA 
GGGCCGGTTA 
CTGACCTGTC 
TCTGGACCAT 
TTCCTCGCTA 
TAAACCGTCA 
CACAAAGTTA 
CATTTTTGCA 
AATCAGAAAC 
AGATTTTCAA 
CTTGAATTTA 



ATATCAGCCA 

AACAATGAAA 

TTCTTGAGCA 

GTTGTTGCTG 

CCTGAATAAT 

GCATGACCGT 

GGCCTGCGGC 

AGCCAACGTG 

ATTAGTAAAA 

CGAAGTTCAC 

GTCAGGTGTC 

CAAGGGTGCT 

CAGCTTTTCC 

GAGTGAATCT 



TGCATGACTG 
AACACACACA 
TAAGAATGGC 
TAAAAGACAT 
GGAGTGGAGA 
GGACAAAACC 
ATATCTATAG 
GAAAGGGCTG 
•GTACTTCAT 
AGCTTTATAC 
ATCTGOTTTC 
GGAj^CAGCTC 
T/ECTTGACCC 
GGGAAATGTA 



AATGGCTTCC 
AAAACAAAAT 
TCAGATACTT 
CAAGAATAAA 
TTGAGCTA^C 
CTGAAQGCAG 
GCA^CCTGTG 
ICGTGAATA 
TTCCTCTTGT 
CAAAATGTAA 
ATTCTTCTAA 
TAGGGCATAT 
CCTCCTTTCC 
GTCACCAGG 



AATCATATAC 
CTTG^ATTTT 
TC^AAGACAT 
'GGGTCATGT 
CTAGCTCCTC 
CTGTTTGTTT 
TTTTCCACCC 
TGCAGACAAG 
ATTTGCTTCA 
GAAGGCTATT 
TCCATATTCA 
ATTTCTCTTA 
CAATTTATTT 



(2) INFORMATION FOR SEQ ID NO* 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 yfemino acids 

(B) TYPE: amine/ acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY/ linear 

(ii) MOLECULE TYBE: protein 

(ix) FEATURE: 

(A) NAMIyy KEY : misc-feature 
(D) OT^ER INFORMATION: /label= "FIGURE 2" 

(ix) FEATURE 

(A) MAME/KEY: Region 

(B) /LOCATION: 1 . . 61 

(D/ OTHER INFORMATION: /label= ALU 
/note= "FIGURE 3 - HUMAN" 

(ix) FEATURE: 

'{A) NAME /KEY : Region 
(B) LOCATION: 268. .309 
(D) OTHER INFORMATION: /label= ALU 
/note- "FIGURE 3 - HUMAN" 

(fa) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION:310. .328 
(D) OTHER INFORMATION: /label= ALU 

/note= "FIGURE 3 - HUMAN" 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION : 57 8 . .618 

(D) OTHER INFORMATION: /label= ALU 
/note= "FIGURE 3 - HUMAN" 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 619. . 653 

(D) OTHER INFORMATION: /label= ALU 
/note= "FIGURE 3 - HUMAN" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Pro Thr Val lie Ser Ala Ser Val AL^ Pro Arg Thr Ala Ala Glu 
1 5 l/j 

Pro Arg Ser Pro Gly Pro Val Pro Ha/ Pro Ala Gin Ser Lys Ala Thr 
20 2/ 30 

Glu Ala Gly Gly Gly Asn Pro Ser/ciy lie Tyr Ser Ala lie lie Ser 
35 4n/ 45 

Arg Asn Phe Pro lie lie Gly/al Lys Glu Lys Thr Phe Glu Gin Leu 
50 55/ 60 

His Lys Lys Cys Leu Glu /ys Lys Val Leu Tyr Val Asp Pro Glu Phe 



75 



65 70 

Pro Pro Asp Glu Thr s/r Leu Phe Tyr Ser Gin Lys Phe Pro lie Gin 



85 



90 



Phe Val Trp Lys Ar£ Pro Pro Glu He Cys Glu Asn Pro Arg Phe He 
100 / 105 110 

He Asp Gly Ala/Asn Arg Thr Asp He Cys Gin Gly Glu Leu Gly Asp 
115 / 120 125 

Cvs Trp Phe /eu Ala Ala He Ala Cys Leu Thr Leu Asn Gin His Leu 
130 / 135 140 



Leu Phe A/g Val He Pro His Asp Gin Ser Phe He Glu Asn Tyr Ala 
145 / 150 155 160 



Glv He/ Phe His Phe Gin Phe Trp Arg Tyr Gly Glu Trp Val Asp Val 
Y 7 165 170 "5 

Val /le Asp Asp Cys Leu Pro Thr Tyr Asn Asn Gin Leu Val Phe Thr 
J 180 185 190 

Lvi Ser Asn His Arg Asn Glu Phe Trp Ser Ala Leu Leu Glu Lys Ala 
195 200 205 

Tyr Ala Lys Leu His Gly Ser Tyr Glu Ala Leu Lys Gly Gly Asn Thr 
210 215 220 

Thr Glu Ala. Met Glu Asp Phe Thr Gly Gly Val Ala Glu Phe Phe Glu 
225 230 235 240 



lie Arg 

Glu Arg 

Thr Tyr 

Arg Met 
290 

Asp Pro 
305 

Gin Tyr 

Ser Val 

Leu Val 

Trp Ser 
370 

Ala Arg 
385 

Tyr Glu 
Thr Ala 
Val Asn 



Asn Phe 
450 

Leu Glu 
465 

Leu Va; 



Ala /Ser 



it His 



Asp Ala Pro Ser Asp Met Tyr Lys He Met 
245 250 



Gly Ser Leu Met Gly Cys Ser He Asp Asp 
260 265 

Gly Thr Ser Pro Ser Gly Leu Asn Met Gly 
275 280 

Val Arq Asn Met Asp Asn Ser Leu Leu Gin, 
295 30/ 

Arg Gly Ser Asp Glu Arg Pro Thr Arg/Thr 
310 3" 

Glu Thr Arg Met Ala Cys Gly Leu/foal Arg 
325 33, 



Lys Lys Ala/Tie 
255 

Gly Thr yAsn Met 
27( 



Glu /Leu He Ala 
21 



Ala Ser 
530 

Gin Arg 
545 



Thr Gly 
340 

Arg Leu 
355 

Asp Arg 

,Leu Gin 

Asp Phe 

Asp Ala 
420 

Glu Gly 
435 

Pro Asp 

Lu Asp 

Ala Leu 

Leu Phe 
500 

Gly Asn 
515 

Lys Ala 
Phe Arg 



Leu Asp Glu Val Pro /Phe Lys Gly 
34 

Arg Asn Pro Trp J5ly Gin Val Glu 
360 

Trp Lys Asp yrp Ser Phe Val Asp 
375/ 380 

His Gin Val Thr Glu Asp Gly Glu 
390 / 395 

He TVr His Phe Thr Lys Leu Glu 
405 / 410 

Le/ Gin Ser Asp Lys Leu Gin Thr 
425 

'Arg Trp Val Arg Gly Cys Ser Ala 
440 



\sp Ser Asp Leu 



He He Pro Val 
320 

Gly His Ala Tyr 
335 

Glu Lys Val Lys 
350 

Trp Asn Gly Ser 
365 

Lys Asp Glu Lys 



Phe Trp Met Ser 
400 

He Cys Asn Leu 
415 

Trp Thr Val Ser 
430 

Gly Gly Cys Arg 
445 



Thr Phe Trp Thr Asn Pro Gin Tyr Arg Leu Lys Leu 
455 460 



Asp Asp Pro Asp Asp Ser Glu Val 
470 475 

Met Gin Lys Asn Arg Arg Lys Asp 
485 490 

Thr He Gly Phe Ala He Tyr Glu 
505 

Lys Gin His Leu Gin Lys Asp Phe 
520 

Arg Ser Lys Thr Tyr He Asn Met 

535 540 

Leu Pro Pro Ser Glu Tyr Val He 
550 555 



He Cys Ser Phe 
480 

Arg Lys Leu Gly 
495 

Val Pro Lys Glu 
510 

Phe Leu Tyr Asn 
525 

Arg Glu Val Ser 



Val Pro Ser Thr 
560 



Tyr Glu Pro His Gin Glu Gly Glu Phe He Leu Arg Val Phe Ser Glu 



565 



570 



575 



Lys Arg Asn Leu Ser Glu Glu Val Glu Asn Thr lie Ser Val Asp/Arg 
Y 580 585 590 / 

Pro Val Lys Lys Lys Lys Thr Lys Pro lie lie Phe Val Ser/Asp Arg 
595 600 605 ' 

Ala Asn Ser Asn Lys Glu Leu Gly Val Asp Gin Glu Ser/lu Glu Gly 
610 615 620 

Lys Gly Lys Thr Ser Pro Asp Lys Gin Lys Gin SerAo Gin Pro Gin 
625 630 635 / 64U 

Pro Gly Ser Ser Asp Gin Glu Ser Glu Glu Gin /in Gin Phe Arg Asn 
6 45 650 / 655 

He Phe Lys Gin lie Ala Gly Asp Asp Met /lu lie Cys Ala Asp Glu 
660 665 / 670 

Leu Lys Lys Val Leu Asn Thr Val Val A^n Lys His Lys Asp Leu Lys 
675 680 / 685 

Thr His Gly Phe Thr Leu Glu Ser Cys" Arg Ser Met He Ala Leu Met 
690 695 / ™0 

Asp Thr Asp Gly Ser Gly Lys Le/ Asn Leu Gin Glu Phe His His Leu 
705 710 / "715 720 

Trp Asn Lys lie Lys Ala Trp/Gln Lys lie Phe Lys His Tyr Asp Thr 

725 / 730 ' ^ 

Asp Gin Ser Gly Thr lie /sn Ser Tyr Glu Met Arg Asn Ala Val Asn 
v 740 / 745 750 

Asp Ala Gly Phe His l4 Asn Asn Gin Leu Tyr Asp lie lie Thr Met 
755 / 760 765 

Arg Tyr Ala Asp Ly^ His Met Asn lie Asp Phe Asp Ser Phe He Cys 
770 / 775 780 

Cys Phe Val Ar^Leu Glu Gly Met Phe Arg Ala Phe His Ala Phe Asp 
785 / 790 795 800 

Lys Asp Gly/sp Gly lie lie Lys Leu Asn Val Leu Glu Trp Leu Gin 
805 810 °lb 

Leu Thr ltyet Tyr Ala 
820 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(/i) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1. .61 

(D) OTHER INFORMATION: /label= ALU 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 268. .309 
(D) OTHER INFORMATION: /label= ALU 

/note= "FIGURE 3 - RAT" 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 310. .328 
(D) OTHER INFORMATION: /label= ALU 

/note= "FIGURE 3 - RAT n v 

(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 57 8. .618 
(D) OTHER INFORMATION: /labe/= ALU 

/note- "FIGURE 3 - /AT" 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION : 619 . . 653 , 
(D) OTHER INFORMATION^ /label= ALU 

/note= "FIGURE 3 - RAT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Pro Thr Val lie /er Pro Thr Val Ala Pro Arg Thr Gly Ala Glu 
1 5 / 10 1 

Pro Arg Ser Pro c/y Pro Val Pro His Pro Ala Gin Gly Lys Thr Thr 
20 / 25 JU 

Glu Ala Gly Gl/ Gly His Pro Gly Gly lie Tyr Ser Ala lie lie Ser 
35 7 40 45 

Arc Asn Phe /ro lie lie Gly Val Lys Glu Lys Thr Phe Glu Gin Leu 
50 / 55 60 

His Lys Js Cys Leu Glu Lys Lys Val Leu Tyr Leu Asp Pro Glu Phe 
65 / 70 75 ™ 

Pro Prc/Asp Glu Thr Ser Leu Phe Tyr Ser Gin Lys Phe Pro lie Gin 
85 90 yb 

Phe v£l Trp Lys Arg Pro Pro Glu lie Cys Glu Asn Pro Arg Phe He 
/ 100 105 110 

Il/ciy Gly Ala Asn Arg Thr Asp lie Cys Gin Gly Asp Leu Gly Asp 
J 115 120 125 

Ls Trp Leu Leu Ala Ala He Ala Cys Leu Thr Leu Asn Glu Arg Leu 
130 135 l^O 

Leu Phe Arg Val He Pro His Asp Gin Ser Phe Thr Glu Asn Tyr Ala 
145 150 155 l foU 



Gly He Phe His Phe Gin 
165 

Val He Asp Asp Cys Leu 
180 

Lys Ser Asn His Arg Asn 
195 

Tyr Ala Lys Leu His Gly 
210 

Thr Glu Ala Met Glu Asp 
225 230 

He Lys Asp Ala Pro Ser 
245 

Glu Arg Gly Ser Leu Met 
260 

Thr Tyr Gly Thr Ser Pro 
275 

Arg Met Val Arg Asn Met 
290 

Asp Pro Arg Ala Ser Asp 
305 310 

Gin Tyr Glu Thr Arg Met 
325 

Ser Val Thr Gly Leu Gil 
340 



Phe Trp Arg Tyr Gly Asp Trp Val Asp Va: 
170 175 

Pro Thr Tyr Asn Asn Gin Leu Val Phe/Thr 
185 190 

Glu Phe Trp Ser Ala Leu Leu Glu/Lys Ala 
200 205 

Ser Tyr Glu Ala Leu Lys Gly/Gly Asn Thr 
215 220 

Phe Thr Gly Gly Val Thr yGlu Phe Phe Glu 
235 / 240 

Asp Met Tyr Lys He Aet Arg Lys Ala He 
250 / 255 

Gly Cys Ser He /sp Asp Gly Thr Asn Met 
265 / 270 

Ser Gly Leu #sn Met Gly Glu Leu He Ala 
280 / 285 

Asp Asn sir Leu Leu Arg Asp Ser Asp Leu 
295 / 300 

Asp Arg Pro Ser Arg Thr He Val Pro Val 
315 320 

A]4 Cys Gly Leu Val Arg Gly His Ala Tyr 
330 335 

Glu Ala Leu Phe Lys Gly Glu Lys Val Lys 
345 350 



Leu Val Arg Leu Arg/Asn 
355 

Trp Ser Asp Gly /frp Lys 
370 

Ala Arg Leu Gan His Gin 
385 / 390 

Tyr Asp As£ Phe Val Tyr 
405 

Thr Ala/Asp Ala Leu Glu 
420 

Val /sn Glu Gly Arg Trp 
435 

As^i Phe Pro Asp Thr Phe 
450 

^Leu Glu Glu Asp Asp Asp 
465 470 



Pro Trp Gly Gin Val Glu Trp Asn Gly Ser 
360 365 

Asp Trp Ser Phe Val Asp Lys Asp Glu Lys 
375 380 

Val Thr Glu Asp Gly Glu Phe Trp Met Ser 
395 400 

His Phe Thr Lys Leu Glu He Cys Asn Leu 
410 415 

Ser Asp Lys Leu Gin Thr Trp Thr Val Ser 
425 430 

Val Arg Gly Cys Ser Ala Gly Gly Cys Arg 
440 445 

Trp Thr Asn Pro Gin Tyr Arg Leu Lys Leu 
455 460 

Pro Asp Asp Ser Glu Val He Cys Ser Phe 

475 480 



Leu Val Ala Leu Met Gin Lys Asn Arg Arg Lys Asp Arg Lys Leu G/y 
485 490 495 

Ala Asn Leu Phe Thr lie Gly Phe Ala lie Tyr Glu Val Pro l/s Glu 
500 505 510 

Met His Gly Asn Lys Gin His Leu Gin Lys Asp Phe Phe Le£ Tyr Asn 
515 520 525 

Ala Ser Lys Ala Arg Ser Lys Thr Tyr lie Asn Met Ar/du Val Ser 
530 535 540 

Gin Arg Phe Arg Leu Pro Pro Ser Glu Tyr Val He 
545 550 555 



'Val 



Pro Ser Thr 
560 



Tyr Glu Pro His Gin Glu Gly Glu Phe lie Leu /rg Val Phe Ser Glu 
Y 565 570 / 575 

Lys Arg Asn Leu Ser Glu Glu Ala Glu Asn jfir He Ser Val Asp Arg 



580 



585 



Pro Val Lys Lys Lys Lys Asn Lys Pro i/e He Phe Val Ser Asp Arg 
595 600 / 605 

Ala Asn Ser Asn Lys Glu Leu Gly Va/Asp Gin Glu Ala Glu Glu Gly 
610 615 / 620 

Lys Asp Lys Thr Gly Pro Asp Ly/Gln Gly Glu Ser Pro Gin Pro Arg 
625 630 / 635 MU 

Pro Gly His Thr Asp Gin Glu/Ser Glu Glu Gin Gin Gin Phe Arg Asn 



645 



650 



lie Phe Arg Gin He Ala Sly Asp Asp Met Glu He Cys Ala Asp Glu 
660 / 665 670 

Leu Lys Asn Val Leu h/n Thr Val Val Asn Lys His Lys Asp Leu Lys 
675 / 680 685 

Thr Gin Gly Phe Th/ Leu Glu Ser Cys Arg Ser Met lie Ala Leu Met 
690 / 695 700 

Asp Thr Asp Gly/er Gly Arg Leu Asn Leu Gin Glu Phe His His Leu 
705 / 710 715 720 

Trp Lys Lys He Lys Ala Trp Gin Lys He Phe Lys His Tyr Asp Thr 
725 730 

Asp His sJ: Gly Thr He Asn Ser Tyr Glu Met Arg Asn Ala Val Asn 
7 740 745 750 

Asp Ala/ciy Phe His Leu Asn Ser Gin Leu Tyr Asp He He Thr Met 
755 760 765 

Arq Tyr Ala Asp Lys His Met Asn He Asp Phe Asp Ser Phe He Cys 
/770 775 780 

Phe Val Arg Leu Glu Gly Met Phe Arg Ala Phe His Ala Phe Asp 



795 



!5 7 90 
iys Asp Gly Asp Gly lie He Lys Leu Asn Val Leu Glu Trp Leu Gin 



805 



810 



815 



Leu Thr Met Tyr Ala 
820 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1. . 61 
(D) OTHER INFORMATION: /label- 

/note= "FIGURE 3 - PIC 

(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 2 68. .309 
(D) OTHER INFORMATION :/Label= ALU 

/note= "FIGURE/3 - PIG" 

(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION : 310 Y.32i 
(D) OTHER INFORMATION: /label= ALU 

/note= /FIGURE 3 - PIG" 

(ix) FEATURE: 

(A) NAME/KE^: Region 

(B) LOCATLON:578. . 618 
(D) OTHEIy INFORMATION : /label= ALU 

te= "FIGURE 3 -PIG" 

(ix) FEATURE 

(A) NAME/KEY: Region 

(B) /LOCATION: 619. .653 
(d/0THER INFORMATION: /label= ALI 

/note= "FIGURE 3 - PIG" 



(xi) /SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Me* Pro Thr Val He Ser Ala Ser Val Ala Pro Arg Thr Ala Ala Glu 
5 10 15 

/pro Arg Ser Pro Gly Pro Val Pro His Pro Ala Gin Ser Lys Ala Thr 
20 25 30 

Glu Ala Gly Gly Gly Asn Pro Ser Gly He Tyr Ser Ala He He Ser 
35 40 45 

Arq Asn Phe Pro He He Gly Val Lys Glu Lys Thr Phe Glu Gin Leu 
50 55 60 



His Lys Lys Cys Leu Glu Lys Lys Val Leu Tyr Val Asp Pro Glu Phe/ 
65 70 75 y 

Pro Pro Asp Glu Thr Ser Leu Phe Tyr Ser Gin Lys Phe Pro Vein 

85 90 y 

Phe Val Trp Lys Arg Pro Pro Glu lie Cys Glu Asn Pro Arq/Phe lie 

ioo 105 

lie Asp Gly Ala Asn Arg Thr Asp lie Cys Gin Gly Glu/eu Gly Asp 
115 120 lf> 

Cvs Trp Phe Leu Ala Ala lie Ala Cys Leu Thr Leu/Asn Gin His Leu 
130 135 H» 

Leu Phe Arg Val lie Pro His Asp Gin Ser Phe/le Glu Asn Tyr Ala 

145 150 15 / 

Gly He Phe His Phe Gin Phe Trp Arg Tyr /ly Glu Trp Val Asp Val 
165 l" 70 / 

Val He Asp Asp Cys Leu Pro Thr Tyr Asn Asn Gin Leu Val Phe Thr 
180 185 / 

Lys Ser Asn His Arg Asn Glu Phe tA Ser Ala Leu Leu Glu Lys Ala 
195 200 / 205 

Tvr Ala Lys Leu His Gly Ser Ty/ Glu Ala Leu Lys Gly Gly Asn Thr 
210 215 / 220 

Thr Glu Ala Met Glu Asp Ph/rhr Gly Gly Val Ala Glu Phe Phe Glu 
225 230 / 235 

lie Arg Asp Ala Pro Ser/sp Met Tyr Lys lie Met Lys Lys Ala lie 
245 / 250 

Glu Arg Gly Ser Leu /et Gly Cys Ser lie Asp Asp Gly Thr Asn Met 
260 / 265 

Thr Tvr Gly Thr s/r Pro Ser Gly Leu Asn Met Gly Asp Leu lie Ala 
275 7 280 285 

Arq Met Val Ar/ Asn Met Glu Asn Ser Arg Leu Arg Asp Ser He Leu 
290 7 295 300 

Asp Pro Glu/val Ser Asp Asp Arg Pro Thr Arg Thr He Val Pro Val 
305 7 310 315 320 

Gin Phe Am Thr Arg Met Ala Cys Gly Leu Val Arg Gly His Ala Tyr 
r 325 330 335 

Ser v/l Thr Gly Leu Glu Glu Ala Leu Phe Lys Gly Glu Lys Val Lys 
/ 340 345 350 

LeAal Arg Leu Arg Asn Pro Trp Gly Gin Val Glu Trp Asn Gly Ser 
/ 355 360 365 

trp Ser Asp Ser Trp Lys Asp Trp Ser Phe Val Asp Lys Asp Glu Lys 
/ 370 375 380 

/ Ala Arg Leu Gin His Gin Val Thr Glu Asp Gly Glu Phe Trp Met Ser 



385 390 395 400 

Tyr Asp Asp Phe lie Tyr His Phe Thr Lys Leu Glu lie Cys Asn L<§u 



405 



Thr Ala Asp Ala Leu Glu Ser Asp Lys Leu Gin Thr Trp Thr /al Ser 
420 425 430/ 

Val Asn Glu Gly Arg Trp Val Arg Gly Cys Ser Ala Gly Thr Gly Arg 
435 440 445/ 

Asn Phe Pro Asp Thr Phe Trp Thr Asn Pro Gin Tyr #rg Leu Lys Leu 
450 455 460/ 

Leu Glu Glu Asp Asp Asp Pro Asp Asp Ser Glu Val lie Cys Ser Phe 

47 5/ 4dU 



465 



470 



Leu Val Ala Leu Met Gin Arg Asn Arg Arg I#s Asp Arg Lys Leu Gly 
485 490/ 495 

Ala Asn Leu Phe Thr lie Gly Phe Ala i/e Tyr Glu Val Pro Lys Glu 
500 505 / 510 

Met His Gly Asn Lys Gin His Leu c/n Lys Asp Phe Phe Leu Tyr Asn 
515 520 / 525 

Ala Ser Lys Ala Arg Ser Arg 1\A Tyr lie Asn Met Arg Glu Val Ser 
530 535 / 540 

Glu Arg Phe Arg Leu Pro Pr/ Ser Glu Tyr Val lie Val Pro Ser Thr 
545 550 / 555 560 

Tvr Glu Pro His Gin Glu/ciy Glu Phe Met Leu Arg Val Phe Ser Glu 
565 / 570 575 

Lys Arg Lys Leu Ser /lu Glu Val Glu Asn Thr lie Ser Val Asp Arg 
580 / 585 590 

Pro Val Arg Lys ds Lys Thr Lys Pro He He Phe Val Ser Asp Arg 
595 7 600 605 

Ala Asn Ser As/ Lys Glu Leu Gly Val Asp Gin Glu Ser Glu Glu Gly 
610 / 615 620 

Gin Asp Lys/Thr Ser Pro Asp Lys Gin Glu Lys Ser Pro Lys Pro Glu 



625 



630 



635 



Pro Ser /sn Thr Asp Gin Glu Ser Glu Glu Gin Gin Gin Phe Arg Asn 
645 650 655 

He Phi Lys Gin He Ala Gly Asp Asp Met Glu He Cys Ala Asp Glu 
660 665 670 

LeuAys Lys Val Leu Asn Thr Val Val Asn Lys His Lys Asp Leu Lys 
' 675 680 685 

(r His Gly Phe Thr Leu Glu Ser Cys Arg Ser Met He Ala Leu Met 
690 695 700 

'asp Thr Asp Gly Ser Gly Lys Leu Asn Leu Gin Glu Phe His His Leu 
705 710 715 720 



Trp Asn Lys lie Lys Ala Trp Gin Lys lie Phe Lys His Tyr Asp/nr 
725 730 7 

Asp Gin Ser Gly Thr lie Asn Ser Tyr Glu Met Arg Asn Ala/al Asn 
740 745 1 y 

Asp Ala Gly Phe His Leu Asn Asn Gin Leu Tyr Asp Ile/le Thr Met 
755 760 76^ 

Arg Tyr Ala Asp Lys His Met Asn lie Asp Phe Asp *fer Phe lie Cys 
770 775 780/ 

Cys Phe Val Arg Leu Glu Gly Met Phe Arg Ala /e His Ala Phe Asp 
785 790 795/ 

Lys Asp Gly Asp Gly lie lie Lys Leu Asn fl Leu Glu Trp Leu Gin 



805 

Leu Thr Met Tyr Ala 
820 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 amino acipTs 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sir 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protei 



(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 1J. 61 
(D) OTHER INFORMATION : /label= ALU 

/note/ "FIGURE 3 - COW" 

(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 268. .309 

(D) OTHER INFORMATION: /label- ALU 
7 n ote- "FIGURE 3 - COW" 

(ix) FEATUF 

(A) /AME/KEY: Region 

(B) /LOCATION:310. .328 
(D/ OTHER INFORMATION: /label- ALU 

/note- "FIGURE 3 - COW" 

(ix) ^ATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 578. .618 
(D) OTHER INFORMATION: /label* ALU 

/note= "FIGURE 3 - COW" 

&4x) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 619. . 653 
(D) OTHER INFORMATION: /label- ALU 



/note= "FIGURE 3 - COW" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Pro Thr Val lie Ser Ala Ser Val Ala Pro Arg Thr Ma Ala Glu 
I 5 10 

Pro Arg Ser Pro Gly Pro Val Pro His Pro Ala Gin s/r Lys Ala Thr 
20 25 / 30 

Glu Ala Gly Gly Gly Asn Pro Ser Gly He Tyr Set Ala He He Ser 
35 40 / 45 

Arg Asn Phe Pro lie lie Gly Val Lys Glu Ly/ Thr Phe Glu Gin Leu 

/ 60 



50 



55 



His Lys Lys Cys Leu Glu Lys Lys Val Le/ Tyr Val Asp Pro Glu Phe 
65 70 / 75 8U 

Pro Pro Asp Glu Thr Ser Leu Phe Tyr/Ser Gin Lys Phe Pro lie Gin 
85 / 90 95 

Phe Val Trp Lys Arg Pro Pro Glu /le Cys Glu Asn Pro Arg Phe lie 
100 /105 I 10 

lie Asp Gly Ala Asn Arg Thr a/ P lie Cys Gin Gly Glu Leu Gly Asp 
115 A20 125 

Cys Trp Phe Leu Ala Ala i/ Ala Cys Leu Thr Leu Asn Gin His Leu 

130 fa 140 

Leu Phe Arg Val He Pr/His Asp Gin Ser Phe He Glu Asn Tyr Ala 
145 1^0 155 

Gly lie Phe His Phe/ln Phe Trp Arg Tyr Gly Glu Trp Val Asp Val 
16jy 170 J- ' ^ 

Val lie Asp Asp c/s Leu Pro Thr Tyr Asn Asn Gin Leu Val Phe Thr 
180 / 185 190 

Lys Ser Asn Hi/ Arg Asn Glu Phe Trp Ser Ala Leu Leu Glu Lys Ala 



195 



200 



Tyr Ala Lys/Leu His Gly Ser Tyr Glu Ala Leu Lys Gly Gly Asn Thr 
210 / 215 220 

Thr Glu /a Met Glu Asp Phe Thr Gly Gly Val Ala Glu Phe Phe Glu 
225 / 230 2 35 

lie Ar^Asp Ala Pro Ser Asp Met Tyr Lys lie Met Lys Lys Ala He 



245 



250 



Glu Arg Gly Ser Leu Met Gly Cys Ser lie Asp Asp Gly Thr Asn Met 

/ o*rn 265 210 



260 



L Tyr Gly Thr Ser Pro Ser Gly Leu Asn Met Gly Glu Leu lie Glu 
r 280 285 



/Arg 



275 

Met Val Arg Asn Met Asp Asn Ser Arg Leu Arg Asp Ser He Leu 
290 295 300 



Asp Pro Glu Val Ser Asp Asp Arg Pro Thr Arg Met lie Val ProAal 
305 310 315 / 

Gin Phe Glu Thr Arg Met Ala Cys Gly Leu Val Arg Gly HisAla Tyr 
325 330 / 335 

Ser Val Thr Gly Leu Glu Glu Ala Leu Tyr Lys Gly Glu/ys Val Lys 
340 345 / 350 

Leu Val Arg Leu Arg Asn Pro Trp Gly Gin Val Glu Trp Asn Gly Ser 



Arg 
355 



360 



365 



Trp Ser Asp Ser Trp Lys Asp Trp Ser Tyr Val A^p Lys Asp Glu Lys 
370 375 

Ala Arg Leu Gin His Gin Val Thr Glu Asp GVy Glu Phe Trp Met Ser 
385 390 /95 400 

Tyr Glu Asp Phe He Tyr His Phe Thr Ly/ Leu Glu He Cys Asn Leu 



405 



Thr Ala Asp Ala Leu Gin Ser Asp Lys/Leu Gin Thr Trp Thr Val Ser 
420 42/5 430 

Val Asn Glu Gly Arg Trp Val Arg/ly Cys Ser Ala Gly Gly Cys Arg 
435 44(V 445 

Asn Phe Pro Asp Thr Phe Trp Thr Asn Pro Gin Tyr Arg Leu Lys Leu 
450 455/ 460 

Leu Glu Glu Asp Asp Asp P/o Asp Asp Ser Glu Val He Cys Ser Phe 
465 470 / 475 480 

Leu Val Ala Leu Met Gl/ Lys Asn Arg Arg Lys Asp Arg Lys Leu Gly 
485 / 490 495 

Ala Ser Leu Phe Thr/le Gly Phe Ala He Tyr Glu Val Pro Lys Glu 
500 / 505 510 

Met His Gly Asn /ys Gin His Leu Gin Lys Asp Phe Phe Leu Tyr Asn 
515 / 520 525 

Ala Ser Lys a/s Arg Ser Lys Thr Tyr He Asn Met Arg Glu Val Ser 
530 / 535 540 

Gin Arg Ph/ Arg Leu Pro Pro Ser Glu Tyr Val He Val Pro Ser Thr 
545 / 550 555 560 

Tvr Glu/ro His Gin Glu Gly Glu Phe He Leu Arg Val Phe Ser Glu 
/ 565 570 575 



Lvs Atq Asn Leu Ser Glu Glu Val Glu Asn Thr He Ser Val Asp Arg 
y ^ y 580 585 590 

Pr/val Lys Lys Lys Lys Thr Lys Pro He He Phe Val Ser Asp Arg 
595 600 605 

(la Asn Ser Asn Lys Glu Leu Gly Val Asp Gin Glu Ser Glu Glu Gly 

610 615 620 



Lys Gly Lys Thr Ser Pro Asp Lys Gin Lys Gin Ser Pro Gin Pro Gl, 
625 630 635 *#v 

Pro Gly Ser Ser Asp Gin Glu Ser Glu Glu Gin Gin Gin Phe Arq/Asn 
645 650 65* 

lie Phe Lys Gin He Ala Gly Asp Asp Met Glu lie Cys Ala/ksp Glu 
660 665 67^ 

Leu Lys Lys Val Leu Asn Thr Val Val Asn Lys His Leu L V S 

6 75 680 68^ 

Thr His Gly Phe Thr Leu Glu Ser Cys Arg Ser Met Ae Ala Leu Met 
690 695 700/ 

Asp Thr Asp Gly Ser Gly Lys Leu Asn Leu Gin aiu Phe His His Leu 
705 710 715/ 

Trp Asn Lys lie Lys Ala Trp Gin Lys lie p/e Lys His Tyr Asp Thr 
^ 725 730 / 735 

Asp Gin Ser Gly Thr lie Asn Ser Tyr Gl/ Met Arg Asn Ala Val Asn 
740 745 / 750 

Asp Ala Gly Phe His Leu Asn Asn Gl/ Leu Tyr Asp lie lie Thr Met 
755 7 60 / 765 

Arg Tyr Ala Asp Lys His Met Asn/lle Asp Phe Asp Ser Phe lie Cys 
770 775 / 780 

Cys Phe Val Arg Leu Glu Gly /et Phe Arg Ala Phe His Ala Phe Asp 
785 790 / 7 ^5 yuu 

Lys Asp Gly Asp Gly He l/e Lys Leu Asn Val Leu Glu Trp Leu Gin 
805 / 810 o±S 

Leu Thr Met Tyr Ala 
820 



(2) INFORMATION FOR SEQ TO NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Ao base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE/TYPE: cDNA 

(ix) FEATURE.- 

(A) NAME/KEY: misc_f eature 
(D) /OTHER INFORMATION: /label= Table 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGGAGCCAA/ CAGAACTGAC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i/ SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(D) OTHER INFORMATION: /label- Table 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTATGACTCG GAAAAGAAGGT / 

(2) INFORMATION FOR SEQ ID NO: 12: / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 20 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS : double / 

( D) TOPOLOGY : linear / 

(ii) MOLECULE TYPE: Primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc^featyfre 
(D) OTHER INFORMATION: /label= Table 1 

(xi) SEQUENCE DESCRIPTION : /SEQ ID NO: 12: 

TAAGCAAAAG CAGTCCCCAC / 

(2) INFORMATION FOR SEQ ID MO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2l/base pairs 

(B) TYPE: nuc/eic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TOTE: primer 

(ix) FEATURE: / 

(A) NAME /KEY : misc_feature 

(D) OT6&R INFORMATION: /label= Table 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TTGCTGTTCC TCACTTTCCTG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
7(A) LENGTH: 20 base pairs 
/ (B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(Ix) MOLECULE TYPE: primer 

/(ix) FEATURE: 




(A) NAME/KEY: misc_f eature 

(D) OTHER INFORMATION :/label= Table 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GTTTCATCTG CTGCTTCGTT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: y 

(A) LENGTH: 20 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME/KEY: misc_f eature / 
(D) OTHER INFORMATION: /laba4= Table 1 

(xi) SEQUENCE DESCRIPTION: SEQ/ID NO: 15: 

CTGGTTCAGG CATACATGGT / 

(2) INFORMATION FOR SEQ ID NO:/l6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nuclei^ acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: Zinear 

(ii) MOLECULE TYPE/ primer 

(ix) FEATURE: / 

(A) NAME/KEY: misc_f eature 

(D) OTHE/ INFORMATION: /label= Table 1 

(xi) SEQUENC/ DESCRIPTION: SEQ ID NO: 16: 



TTCTTTATGT GG#CCCTGAG TT 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
7(A) LENGTH: 19 base pairs 
/ (B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(/i) MOLECULE TYPE: primer 

/(ix) FEATURE: 

/ (A) NAME/KEY: misc_f eature 

/ (D) OTHER INFORMATION: /labels Table 1 

/ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



'ACGAACTGGA TGGGGAACT 



(2) INFORMATION FOR SEQ ID NO: 18 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eature / 
(D) OTHER INFORMATION: /label- TabLe 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTCAGTACCTCCCGTTCACC / 

(2) INFORMATION FOR SEQ ID NO: 19: / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 20 base pairs/ 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double/ 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 

(A) NAME /KEY : misc /feature 

(D) OTHER INFORMATION : /label= Table 3 

(xi) SEQUENCE DESCRIPt/oN: SEQ ID NO: 19: 

GAT GC T T G AGCC AG GAAAAC / 

(2) INFORMATION FOR SEc/ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH/ 22 base pairs 

(B) TYPE:/nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULyE TYPE: primer 

(ix) FEATURE: 

(A) /NAME/KEY: misc_f eature 

(D)/ OTHER INFORMATION: /label= Table 3 

(xi) SEC2UENCE DESCRIPTION: SEQ ID NO: 20: 

CTTTCCTTGAAGGTAGCTGTAT 

(2) INFORMATION FOR SEQ ID NO: 21 : 

(i/ SEQUENCE CHARACTERISTICS: 
/ (A) LENGTH: 20 base pairs 
/ (B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

GAGGTGCTGAGTGAGAGGAC 

(2) INFORMATION FOR SEQ ID NO: 22 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME/ KEY : misc__f eabure 
(D) OTHER INFORMATION yylabel= Table 3 

(xi) SEQUENCE DESCRIPTION :/SEQ ID NO: 22 

ACTCCGTCTCAAAAAAATACCT 

(2) INFORMATION FOR SEQ ID/NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE t/pE: primer 

(ix) FEATURE 

(A) NA^E/KEY: misc_f eature 
(D) OT^HER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 : 

ATTGTCCCTTTAGfCTCCTGG 

(2) INFORMATION FOR SEQ ID NO: 24 

(i) ^QUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(A) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
(D) OTHER INFORMATION: /label= Table 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 : 
TGGAAGTAGGAGAGTGGGCA 

(2) INFORMATION FOR SEQ ID NO: 25 : / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 20 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eature/ 
(D) OTHER INFORMATION: /laj6el= Table J 

(xi) SEQUENCE DESCRIPTION: SE0 ID NO:25 : 

GGGTAGATGGGTGGGAAGTT / 

(2) INFORMATION FOR SEQ ID NO/26 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 b^se pairs 

(B) TYPE: nucle/c acid 

(C) STRANDEDNES5S : double 

(D) TOPOLOGY: /linear 

(ii) MOLECULE TYP/: primer 

(ix) FEATURE: / 

(A) NAME^CEY: misc_f eature 

(D) OTHER INFORMATION: /labels Table 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26 : 

GAGGAATGTGGAGGAAGGAC 

(2) I N FORMAT TON FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
/(A) LENGTH: 20 base pairs 
/{B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(j/) MOLECULE TYPE: primer 

/ix) FEATURE: 

/ (A) NAME/KEY: misc_f eature 

/ (D) OTHER INFORMATION: /label= Table 

/ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

/tcctgtgagtgaggtctcg 
/ (2) information for seq id no: 28 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eature / 
(D) OTHER INFORMATION: /label= Tab^e 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOy?8 : 

GGAACTCTGTGACCCCAAAT / 

(2) INFORMATION FOR SEQ ID NO: 29 : / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 20 base pairyS 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: doub/e 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: prime/ 

(ix) FEATURE: / 

(A) NAME /KEY : mi sTc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29 : 

TCCTCAAACAAAACATTCGC / 

(2) INFORMATION FOR ^EQ ID NO: 30 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs . 

(B) TYRE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLEQULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

/D) OTHER INFORMATION: /label= Table 3 

(xi) /eQUENCE DESCRIPTION: SEQ ID NO: 30: 

GTTCCCTyCATTCTCCATCG 

(2) INFORMATION FOR SEQ ID NO: 31 : 

/i) SEQUENCE CHARACTERISTICS: 
/ (A) LENGTH: 21 base pairs 

/ (B) TYPE: nucleic acid 

/ (C) STRANDEDNESS: double 

/ (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: primer 



• 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 : y 

GTTATTTCAACCCAGACCCTT / 

(2) INFORMATION FOR SEQ ID NO: 32 : / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 21 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME/KEY: misc_featu/e 
(D) OTHER INFORMATION: //abel= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 : 

AATGGGTTCTCTGGTTACTGC / 

(2) INFORMATION FOR SEQ ID NO : 33 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 JSase pairs 

(B) TYPE: nucLieic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY/ linear 

(ii) MOLECULE TYBE: primer 

(ix) FEATURE: / 

(A) NAME/KEY: misc_f eature 

(D) OTH^R INFORMATION: /labels Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 : 

AGCACGAAAAGCA^GATAAA 

(2) INFORMATION FOR SEQ ID NO: 34 : 

(i) SEQUENCE CHARACTERISTICS: 
fk) LENGTH: 20 base pairs 
/(B) TYPE: nucleic acid 
/(C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(ii/ MOLECULE TYPE: primer 

(/x) FEATURE: 

/ (A) NAME/KEY: misc_f eature 
/ (D) OTHER INFORMATION: /label= Table 3 

/(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34 : 



/ 




GTAAGAGATTTGCCCCCCAG 
(2) INFORMATION FOR SEQ ID NO: 35 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
(D) OTHER INFORMATION: /labeW Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ Il/NO:35 : 

TCTGCGGATCATTGGTTTTG / 

(2) INFORMATION FOR SEQ ID NO: 36/: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base/pairs 

(B) TYPE: nucleic AcLd 

(C) STRANDEDNESS :/ double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: /rimer 

(ix) FEATURE: / 

(A) NAME /KEY: misc_feature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36 : 

ccttcccttcttcctgc/tc 

(2) information /for seq id no: 37 : 

(i) SEQUENCE CHARACTERISTICS: 
(A)/LENGTH: 20 base pairs 
(B/ TYPE: nucleic acid 
(Z) STRANDEDNESS: double 
fa) TOPOLOGY: linear 

(ii) yMOLECULE TYPE: primer 

(ix/ FEATURE: 

/ (A) NAME/KEY: misc_f eature 
/ (D) OTHER INFORMATION :/label= Table 3 

/xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 : 

ct/tcttccccacccttacc 

(i) information for seq id no: 38 : 

/ (i) SEQUENCE CHARACTERISTICS: 

/ (A) LENGTH: 20 base pairs 

/ (B) TYPE: nucleic acid 





(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(D) OTHER INFORMATION: /labels Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 : / 

CCTCCTCACCTGCTCCCATA / 

(2) INFORMATION FOR SEQ ID NO: 39 : / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 20 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eatufce 
(D) OTHER INFORMATION: Mabel= Table 3 

(xi) SEQUENCE DESCRIPTION: S5EQ ID NO:39 : 

TTTTTCGGCTTAGACCCTCC / 

(2) INFORMATION FOR SEQ ID NO: 40 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 Vase pairs 

(B) TYPE: nuclesic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY J linear 

(ii) MOLECULE TYPg: primer 

(ix) FEATURE: / 

(A) NAME /KEY : misc_f eature 

(D) OTHEm INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 : 

TGTGGGGAATAGAAATAAATGG 

(2) INFORMATION FOR SEQ ID NO: 41 : 

(i) SEQUENCE CHARACTERISTICS: 
fk) LENGTH: 19 base pairs 
/(B) TYPE: nucleic acid 
/(C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(ii/ MOLECULE TYPE: primer 

{iA) FEATURE: 

7 (A) NAME/KEY: misc_f eature 



(D) OTHER INFORMATION: /label= Table 3 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 : 
CCAGGAGCTCTGTGGGTCA 
(2) INFORMATION FOR SEQ ID NO: 42 : 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 21 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME /KEY : misc_f eature / 
(D) OTHER INFORMATION: /labyl= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ Ad NO: 42 : 

GGCTCCTCATCCTCATTCACA / 

(2) INFORMATION FOR SEQ ID NO : 4p : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base! pairs 

(B) TYPE: nucleic/acid 

(C) STRANDEDNESS \ double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: /rimer 

(ix) FEATURE: / 

(A) NAME/KE^ misc_feature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GTGGAGGAGGGTGAGTGTGC 

(2) INFORMATION TOR SEQ ID NO: 44 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) /LENGTH: 20 base pairs 

(B) /TYPE: nucleic acid 
(C/ STRANDEDNESS: double 
(f) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) /FEATURE: 

/ (A) NAME/KEY: misc_f eature 
/ (D) OTHER INFORMATION: /label= Table 3 

(/i) SEQUENCE DESCRIPTION: SEQ ID NO: 44 : 

TGTGGCAGGACAGGACGTTC 

(2V INFORMATION FOR SEQ ID NO: 45 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear y 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME/KEY: misc_f eature / 
(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID Nj/:45 : 

TTCAACCTCTGGAGTGGGCC / 

(2) INFORMATION FOR SEQ ID NO: 46 : / 

(i) SEQUENCE CHARACTERISTICS/ 

(A) LENGTH: 20 base pa/rs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: do/ble 
(D) TOPOLOGY: lineat 

(ii) MOLECULE TYPE: pryfter 

(ix) FEATURE: / 

(A) NAME/KEY: rfisc__f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 : 

CACCAGAGCAAACCGTCCACZ 

(2) INFORMATION FQ/R SEQ ID NO: 47 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) A'YPE: nucleic acid 

(C) / STRANDEDNESS: double 
(D/ TOPOLOGY: linear 

(ii) MC5LECULE TYPE: primer 

(ix) /FEATURE: 

/ (A) NAME /KEY : misc_f eature 
/ (D) OTHER INFORMATION: /labels Table 3 

(/i) SEQUENCE DESCRIPTION: SEQ ID NO: 47 : 

ACAtfCCCAGACTCCCATTCC 

(2/ INFORMATION FOR SEQ ID NO: 48 : 

/ (i) SEQUENCE CHARACTERISTICS: 
/ (A) LENGTH: 20 base pairs 

/ (B) TYPE: nucleic acid 

' (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature J 
(D) OTHER INFORMATION : / label= Table 3 / 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48 : / 

TTCTCTTCTCCCTTCACCCT / 

(2) INFORMATION FOR SEQ ID NO: 49 : / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 22 base pairs / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION /label- Table 3 

(xi) SEQUENCE DESCRIPTIO^ SEQ ID NO: 49 : 

ACACACTTCATGCTCTCTACCC / 

(2) INFORMATION FOR SEQ I'D NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: E0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLQJGY: linear 

(ii) MOLECULE /TYPE : primer 

(ix) FEATURE/ 

(A) NAME /KEY : misc_f eature 

(D) CITHER INFORMATION :/label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 : 

ccgcctattcc/ttcctctt 

(2) information for seq id no: 51 : 

(i)/ SEQUENCE CHARACTERISTICS: 
/ (A) LENGTH: 20 base pairs 
/ (B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

/(ii) MOLECULE TYPE: primer 

/ (ix) FEATURE: 

/ (A) NAME/KEY: misc_f eature 

/ (D) OTHER INFORMATION: /label= Table 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 




GACAAACTCCTGGGAAGCCT 
(2) INFORMATION FOR SEQ ID NO: 52 : 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double / 

(D) TOPOLOGY: linear / 

MOLECULE TYPE: primer / 

FEATURE : / 
(A) NAME /KEY : misc_feature / 
<D) OTHER INFORMATION: /label= /able 3 

SEQUENCE DESCRIPTION: SEQ I D JfiO : 5 2 : 

ACCTCTGACCCCTGTGAACC / 

(2) INFORMATION FOR SEQ ID NO: 53: / 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base n6irs 

(B) TYPE: nucleic acad 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pr/mer 

(ix) FEATURE: / 

(A) NAME/KEY/ misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 53 : 

TGTGGATTTGTGTGCTAGGC 

(2) INFORMATION yFOR SEQ ID NO:54 : 

(i) SEQUENCE CHARACTERISTICS: 
(A)/LENGTH: 21 base pairs 
{BA TYPE: nucleic acid 
{£) STRANDEDNESS: double 
/d) TOPOLOGY: linear 

( ii ) ^MOLECULE TYPE: primer 
(i/) FEATURE: 

/ (A) NAME /KEY : misc_f eature 
/ (D) OTHER INFORMATION: /label= Table 3 

/ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 : 

CATAAATAGCACCGACAGGGA 

/(2) INFORMATION FOR SEQ ID NO: 55 : 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 




(ii) 
(ix) 



(xi) 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

GGGATGGAGAAGAGTGAGGA 

(2) INFORMATION FOR SEQ ID NO: 56 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_f e&ture 
(D) OTHER INFORMATION: /label- Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

TCCTCACTCTTCTCCATCCC 

(2) INFORMATION FOR SEO/lD NO: 57 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH/ 19 base pairs 

(B) TYPE:/nucleic acid 

(C) STRANDEDNESS: double 

(D) TOROLOGY : linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

{M NAME /KEY : misc_f eature 
)) OTHER INFORMATION: /label= Table 3 

(xi) /eQUENCE DESCRIPTION: SEQ ID NO: 57 : 

ACCCTGTATGTTGCCTTGG 

(2) INFORMATION FOR SEQ ID NO: 58 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 



(ix) 



FEATURE: 




(A) NAME/KEY: misc_f eature 

(D) OTHER INFORMATION : /label= Table 3 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 : 

GGGGATTTTGCTGTGTGCTG 

(2) INFORMATION FOR SEQ ID NO: 59 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double / 

( D) TOPOLOGY : linear / 

(ii) MOLECULE TYPE: primer / 

(ix) FEATURE: / 
(A) NAME/KEY: misc_f eature / 
(D) OTHER INFORMATION: /label=^ Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ICrNO:59 : 

ATTCCTGCTCCCACCGTCTC / 

(2) INFORMATION FOR SEQ ID NO:60/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 baser pairs 

(B) TYPE : nucleic/acid 

(C) STRANDEDNESS/ double 

( D) TOPOLOGY : l/near 

(ii) MOLECULE TYPE \/ primer 

(ix) FEATURE: / 

(A) NAME/KEY: misc_f eature 

(D) OTHERr INFORMATION : /label= Table 3 

(xi) SEQUENCE/DESCRIPTION: SEQ ID NO: 60 : 

CACAGAGTGTCCGAQAGGCA 

(2) I N FORMAT ION FOR SEQ ID NO: 61 : 

(i) SEQUENCE CHARACTERISTICS: 
/(A) LENGTH: 22 base pairs 
/ (B) TYPE: nucleic acid 
/ (C) STRANDEDNESS: double 
/ (D) TOPOLOGY: linear 

(/i) MOLECULE TYPE: primer 

/ (ix) FEATURE: 
/ (A) NAME /KEY : misc_f eature 

/ (D) OTHER INFORMATION: /label= Table 3 

/ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 : 



GGAGATTATCAGGTGAGATGCC 



(2) INFORMATION FOR SEQ ID NO: 62 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(D) OTHER INFORMATION: /label= Table 3 y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

CAGAGTGTCC GAG AG G C AG G G 

(2) INFORMATION FOR SEQ ID NO: 63 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME/KEY: misc /feature 
(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIP'JtfON : SEQ ID NO: 63 

CGTTGACCCCTCCACCTTGA 

(2) INFORMATION FOR SE)£ ID NO: 64 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTJl: 20 base pairs 

(B) TYPE/ nucleic acid 

(C) STRANDEDNESS: double 

(D) TOCOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATORE: 

{M NAME/KEY: misc_feature 
(p) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 : 

GGGAAAAOATGCACCTTCTT 

(2) IMfORMATION FOR SEQ ID NO: 65: 

'(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



# 



(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 : 

TAGGGGGTAAAATGGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 66 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix) FEATURE: 

(A) NAME /KEY : misc/ feature 
(D) OTHER INFORMATION: /label= Table 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

ACTAACTCAGTGGAATAGGG 

(2) INFORMATION FOR/SEQ ID NO: 67 : 

( i ) SEQUENCE/ CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) T7YPE : nucleic acid 

(C) /STRANDEDNESS: double 
(D/ TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(ix/ FEATURE: 

(A) NAME /KEY : misc_f eature 

(D) OTHER INFORMATION: /label= Table 3 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 : 
SAGCTAGGATAGCTCAAT 



